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Description 
Technical Field 

5 This Invention is in the fields of genetic engineering, plant biology, and bacteriology. 



Background Art 

10 In the past decade, the science of genetic engineering has developed rapidly. A variety of processes 
are known for inserting a heterologous gene into bacteria, whereby the bacteria become capable of efficient 
expression of the inserted genes. Such processes normally involve the use of plasmids which may be 
cleaved at one or more selected cleavage sites by restriction endonucleases, discussed below. Typically, a 
gene of interest is obtained by cleaving one piece of DNA and the resulting DNA fragment is mixed with a 

75 fragment obtained by cleaving a vector such as a piasmid. The different strands of DNA are then connected 
("ligated") to each other to form a reconstituted plasmid. See, for example. U.S. Patents 4,237.224 (Cohen 
and Boyer. 1980); 4,264.731 (Shine, 1981); 4,273,875 (Manls, 1981); 4,322,499 (Baxter et al, 1882), and 
4,336.336 (Silhavy et al. 1982). A variety of other reference works are also available. Some of these works 
describe the natural processes whereby DNA is transcribed into messenger RNA (mRNA) and mRNA is 

20 translated Into protein; see, e.g.. Stryer, 1981 (note: all references cited herein, other than patents, are listed 
with citations after the Examples); Lehninger, 1975. Other works describe methods and products of genetic 
manipulation; see. e.g.. Manlatis et al, 1982; SetJow and Hollaender, 1979. 

Most of the genetic engineering work performed to date involves the insertion of genes into various 
types of cells, primarily bacteria such as E. con. various other types of microorganisms such as yeast, and 

25 mammalian cells. However, many of the techniques and substances used for genetic engineering of animal 
cells and microorganisms are not directly appBcable to genetic engineering involving plants. 

As used herein, the term "plant* refers to a multicellular differentiated organism that is capable of 
photosynthesis, such as anglosperms and multicellular algae. This does not include microorganisms, such 
as bacteria, yeast and fungi. However, the term "plant cells" includes any cell derived from a plant; this 

30 Includes undifferentiated tissue such as callus or crown gall tumor, as well as plant seeds, propagules, 
pollen; and plant embryos. 

A variety of plant genes have been isolated, some of which have been published and/or are publicly 
available. Such genes include the soybean actin gene (Shah et al 1982), corn zein (Pederson et al, 1982) 
soybean leghemoglobin (Hyldlg-Nlelsen et al. 1982), and soybean storage proteins (Fischer and Goldberg. 

35 1982). 



The Regions of a Gene 

40 The expression of a gene Involves the creation of a polypeptide which is coded for by the gene. This 
process involves at least two steps: part of the gene Is transcribed to form messenger RNA, and part of the 
mRNA is translated Into a polypeptide. Although the processes of transcription and translation are not fully 
understood, it is believed that the transcription of a DNA sequence into mRNA is controlled by several 
regions of DNA. Each region is a series of bases (i.e., a series of nucleotide residues comprising adenosine 

45 (A), thymidine (T). cytidine (C), and guanidlrte (G)) which are In a desired sequence. Regions which are 
usually present in a eucaryotic gene are shown on Rgure 1. These regions have been assigned names for 
use herein, and are briefly discussed below. It should be noted that a variety of terms are used in the 
literature, which describes these regions In much more detail. 

An association region 2 causes RNA polymerase to associate with the segment of DNA. Transcription 

so does not occur at association region 2; instead, the RNA polymerase normally travels along an Intervening 
region 4 for an appropriate distance, such as about 100-300 bases, after it is activated by association region 
2. 

A transcription initiation sequence 6 directs the RNA polymerase to begin synthesis of mRNA. After it 
recognizes the appropriate signal, the RNA polymerase Is believed to begin the synthesis of mRNA an 
55 appropriate distance, such as about 20 to about 30 bases, beyond the transcription initiation sequence 6. 
This is represented in Rgure 1 by Intervening region & 

The foregoing sequences are referred to collectively as the promoter region of the gene. 

The next sequence of DNA is transcribed by RNA polymerase into messenger RNA which is not 
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translated Into protein. In general, the 5* end of a strand of mRNA attaches to a ribosome. In bacterial cells, 
this attachment is facilitated by a sequence of bases called a "ribosome binding site" (RBS). However, in 
eucaryotic cells, no such RBS sequence Is known to exist Regardless of whether an RBS exists in a strand 
of mRNA. the mRNA moves through the ribosome until a "start codon" is encountered. The start codon Is 

5 usually the series of three bases, AUG; rarely, the codon QUO may cause the initiation of translation. The • 
non-translated portion of mRNA located between the 5 V end of the mRNA and the start codon Is referred to 
as the 5^ non- translated region 10 of the mRNA. The corresponding sequence In the DNA is also referred to 
herein as 5' non-translated region 12. The specific series of bases In this sequence is not believed to be of 
great importance to the expression of the gene; however, the presence of a premature start codon might 

to affect the translation of the mRNA (see Kozak, 1978). 

A promoter sequence may be significantly more complex than described above; for example, certain 
promoters present in bacteria contain regulatory sequences that are often referred to as "operators." Such 
complex promoters may contain one or more sequences which are involved in induction or repression of 
the gene. One example is the lac operon, which normally does not promote transcription of certain lactose- 

75 utilizing enzymes unless lactose is present In the ceil. Another example is the trp operator, which does not 
promote transcription or translation of certain tryptophan-creating enzymes if an excess of tryptophan is 
present in the cell. See, e.g.. Miller and Reznikoff, 1982. 

The next sequence of bases Is usually called the coding sequence or the structural sequence 14 (in the 
DNA molecule) or 16 (in the mRNA molecule). As mentioned above, the translation of a polypeptide begins 

20 when the mRNA start codon. usually AUG, reaches the translation mechanism in the ribosome. The start 
codon directs the ribosome to begin connecting a series of amino acids to each other by peptide bonds to 
form a polypeptide, starting with methionine, which always forms the amino terminal end of the polypeptide 
(the methionine residue may be subsequently removed from the polypeptide by other enzymes). The bases 
which follow the AUG start codon are divided into sets of 3, each of which is a codon. The "reading frame", 

25 which specifies how the bases are grouped together into sets of 3, Is determined by the start codon. Each 
codon codes for the addition of a specific amino acid to the polypeptide being formed. The entire genetic 
code (there are 64 different codons) has been solved; see. e.g., Lehnlnger, supra, at p. 862. For example, 
CUA Is the codon for the amino acid leucine; GGU specifies glycine, and UGU specifies cysteine. 

Three of the codons (UAA, UAG, and UGA) are "stop" codons; when a stop codon reaches the 

30 translation mechanism of a ribosome, the polypeptide that was being formed disengages from the 
ribosome, and the last preceding amino acid residue becomes the carboxy terminal end of the polypeptide. 

The region of mRNA which Is located on the 3' side of a stop codon In a monocistronfc gene is referred 
to herein as £ non-translated region 18. This region 18 is believed to be involved In the processing, 
stability, and/or transport of the mRNA after it Is transcribed. This region 18 is also believed to contain a 

35 sequence of bases, pofy-adenylation signal 20, which is recognized by an enzyme in the ceil. This enzyme 
adds a substantial number of adenosine residues to the mRNA molecule, to form poly-A tail 22. 

The DNA molecule has a 3* non-translated region 24 and a poly-adenylation signal 26. which code for 
the corresponding mRNA regio~n 18 and signal 20. However, the DNA molecule does not have a poly-A tail. 
Poly-adenylation signals 20 (mRNA) and 26 (DNA) are represented in the figures by a heavy dot 

40 

Gene-Host Incompatibility 

The same genetic code Is utilized by all Irving organisms on Earth. Plants, animals, and microorganisms 
45 all utilize the same correspondence between codons and amino acids. However, the genetic code applies 
only to the structural sequence of a gene, i.e., the segment of mRNA bounded by one start codon and one 
stop codon which codes for the translation of mRNA into polypeptides. 

However, a gene which performs efficiently in one type of cell may not perform at all In a different type 
of cell. For example, a gene which is expressed in E. coO may be transferred into a different type of 
so bacterial ceil, a fungus, or a yeast However, the gene might not be expressed in the new host ceil. There 
are numerous reasons why an intact gene which is expressed in one type of cell might not be expressed in 
a different type of cell. See. e.g.. SakaguchJ and Okanishi, 1981. Such reasons include: 

1. the gene might not be replicated or stably inherited by the progeny of the new host cell. 

2. the gene might be broken apart by restriction endonuc leases or other enzymes in the new host cell. 
55 3. the promoter region of the gene might not be recognized by the RNA polymerases in the new host 

ceil. 

4. one or more regions of the gene might be bound by a repressor protein or other molecule in the new 
host cell, because of a DNA region which resembles an operator or other regulatory sequence of the host* s 
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DMA. For example, the lac operon includes a polypeptide which binds to a particular sequence of bases 
next to the lac promoter unless the polypeptide is itself inactivated by "lactose. See, e.g., Milter and 
Reznikoff, 1982. 

5. one or more regions of the gene might be deleted, reorganized, or relocated to a different part of the 
5 host* s genome. For example, numerous procaryotic cells are known to contain enzymes which promote 
genetic recombination; (such as the rec proteins in EL cofl; see, e.g., Shibata et ai, 1979) and transposition 
(see. e.g.. The 45th Cold Spring Harbor Symposium oTTSuantitatrve Biology, 1981). In addition, naturally- 
occurring genetic modification can be enhanced by regions of homology between different strands of DNA; 
see, e.g.. Radding, 1978. 

10 6. mRNA transcribed from the gene may suffer from a variety of problems. For example, it might be 
degraded before H reaches the ribosome, or it might not be poly-adenylated or transported to the ribosome, 
or it might not interact property with the ribosome, or it might contain an essential sequence which is 
deleted by RNA processing enzymes. 

7. the polypeptide which is created by translation of the mRNA coded for by the gene may suffer from 

75 a variety of problems. For example, the polypeptide may have a toxic effect on the cell, or it may be 
glycosylated or converted into an altered polypeptide, or it may be cleaved into shorter polypeptides or 
amino acids, or it may be sequestered within an intracellular compartment where it is not functional. 

In general, the likelihood of a foreign gene being expressed in a cell tends to be lower if the new host 
cell is substantially different from the natural host cell. For example, a gene from a certain species of 

20 bacteria is Gkefy to be expressed by other species of bacteria within the same genus. The gene is less 
likely to be expressed by bacteria of a different genus, and even less likely to be expressed by non- 
bacterial microorganisms such as yeast fungus, or algae, it Is very unlikely that a gene from a cell of one 
kingdom (the three kingdoms are plants, animals, and "protista" (microorganisms)) could be expressed in 
cells from either other kingdom. 

25 These and other problems have, until now, thwarted efforts to obtain expression of foreign genes into 
plant cells. For example, several research teams have reported the insertion of foreign DNA into plant cells; 
see, e.g.. Lurquin, 1979; Krens et ai. 1982; Oavey et al, 1980. At least three teams of researchers have 
reported the insertion of entire genes into plant ceils. By use of radioactive DNA probes, these researchers 
have reported that the foreign genes (or at least portions thereof) were stably inherited by the descendants 

30 of the plant cells. See Hemaisteens et al, 1980; Qarflnkel et ai, 1981; Matzxe and Chilton, 1981. However, 
there was no reported evidence that the foreign genes were expressed in the plant ceils. 

Several natural exceptions to the gene-host Incompatibility barriers have been discovered. For example, 
several E coti genes can be expressed in certain types of yeast cells, and vice-versa See Beggs, 1978; 
Struhl et al. 1979. 

35 In addition, certain types of bacterial cells, including Ag no bacterium tumefaciens and A. rhizogenes. are 
capable of infecting various types of plant cells, causing plant diseases such as crown gaT tumor and hairy 
root disease. These Agrobacterium cells carry plasmfds, designated as Ti plasmfds and Ri plasmids, which 
carry genes which are expressed in plant ceils. Certain of these genes code for enzymes which create 
substances called "opines," such as octopine, nopaflne, and agropine. Opines are utilized by the bacteria 

40 cells as sources of carbon, nitrogen, and energy. See, e.g., Petit and Tempo, 1978. The opine genes are 
believed to be Inactive while in the bacterial ceils; these genes are expressed only after they enter the piant 
cells. 

In adoption, a variety of man-made efforts have been reported to overcome one or more of the gene- 
host incompatibility barriers. For example, it has been reported that a mammalian polypeptide which is 

45 normally degraded within a bacterial host can be protected from degradation by coupling the mammalian 
polypeptide to a bacteria] polypeptide that normally exists In the host cell. This creates a "fusion protein;" 
see. e.g.. ftakura et al, 1977. As another example. In order to avoid cleavage of an inserted gene by 
endonucleases in the host cell, it Is possible to either (1) Insert the gene into host cells which are deficient 
in one or more endonucleases, or (2) duplicate the gene In cells which cause the gene to be methylated. 

so See, e.g., Maniatis et al, 1981. 

In addition, various efforts to overcome gene-host Incompatibility barriers involve chimeric genes. For 
example, a structural sequence which codes for a mammalian polypeptide, such as insulin, interferon, or 
growth hormone, may be coupled to regulatory sequences from a bacterial gene. The resulting chimeric 
gene may be inserted Into bacterial cells, where it will express the mammalian polypeptide. See, e.g.. 

65 Quarente et ai. 1980. Alternately, structural sequences from several bacterial genes have been coupled to 
regulatory sequences from viruses which are capable of infecting mammalian cells. The resulting chimeric 
genes were Inserted into mammalian cells, where they reportedly expressed the bacterial polypeptide. See. 
e.g.. Southern and Berg, 1982; Colbere-Qarapin et al. 1982. 
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Restriction Endonucleases 

In genera), an endonuclease Is an enzyme which Is capable of breaking DNA into segments of DMA. An 
endo nuclease Is capable of attaching to a strand of DNA somewhere In the middle of the strand, and 
s breaking it By comparison, an exo nuclease removes nucleotides from the end of a strand of DNA. All of 
the endonucteases discussed herein are capable of breaking double-stranded DNA into segments. This may 
require the breakage of two types of bonds: (1) covalent bonds between phosphate groups and deoxyribose 
residues, and (2) hydrogen bonds (A-T and OG) which hold the two strands of DNA to each other. 

A "restriction endonuclease" (hereafter referred to as an endonuclease) breaks a segment of DNA at a 
10 precise sequence of bases. For example, EcoRI and Haelil recognize and cleave the following sequences: 



EcoRI : 



15 




XXG ^ 
YYCTTAA 



AATTCXX 
GYY 



20 



Haelil: 5'- GGZC 
CC3G 



XXGG CCXX 
YYCC ^ GGYY 



In the examples cited above, the EcoRI cleavage created a "cohesive" end with a 5' overhang (i.e.. the 
single-stranded "tail" has a 5' end rather than a 3' end). Cohesive ends can be useful In promoting desired 
ligations. For example, an EcoRI end is more likely to anneal to another EcoRI end than to a Haelil end. 

Over 100 different endonucteases are known, each of which is capable of cleaving DNA at specific 
sequences. See, e.g., Roberts, 1982. All restriction endonucleases are sensitive to the sequence of bases. 
In addition, some endonucleases are sensitive to whether certain bases have been methylated. For 
example, two endonucleases, Mbol and Sau3a are capable of cleaving the following sequence of bases as 
shown: 




Mbol cannot cleave this sequence If the adenine residue is methylated (me-A). Sau3a can cleave this 
sequence, regardless of whether either A is methylated. To some extent the methylation (and therefore the 
45 cleavage) of a ptasmid may be controlled by replicating the plasmlds in cells with desired methylation 
capabilities. An EL cofi enzyme, DNA adenine methyl ase (dam), methylates the A residues that occur in 
QATC sequences? Strains of E. coH which do not contain the dam enzyme are designated as dam-cells. 
Ceils which contain dam are designated as dam + or dam celts. 

Several endonucleases are known which cleave different sequences, but which create cohesive ends 
so which are fully compatible with cohesive ends created by other endonucleases. For example, at least five 
different endonucleases create 5* QATC overhangs, as shown in Table 1. 



65 
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Table 1 



to 



15 



20 



25 



Endonuclease 
MboZ 

Inhibited by me-A 

Sau3a 
Utiaffected by me-A 

Bglll 
Unaffected by me-A 

Bell 

Inhibited by me-A 



Sequence 






BamHI 
Unaffected by me-A 




A cohesive end created by any of the enconucteases listed In Table 1 will ligate preferentially to a 
cohesive end created by any of the other endonucleases. However, a ligation of, for example, a Bglll end 
35 with a BamHI end will create the following sequence: 

AGATCC 
TCTAGG 



40 



This sequence cannot be cleaved by either Bgl II or BamHI; however, it can be cleaved by Mbol 
(unless methylated) or by Sau3a. 

Another endonuclease which involves the GATC sequence is Pvul, which creates a 3' overhang, as 
follows: 



50 




Another endonuclease, Cfai, cleaves the following sequence: 



55 
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Y TAGcjEAY 

5 

If Xi is G, or if X2 is C, then the sequence may be cleaved by Mbol (unless methylated, In which case 
ClaJ is also inhibited) or Sau3a. 



Viral Promoters 

A virus is a microorganism comprising single or double stranded nucleic acid (DNA or RNA) contained 
within a protein (and possibly lipid) shell called a "capsld" or "coat". A virus is smaller than a cell, and it 
does not contain most of the components and substances necessary to conduct most biochemical 
processes. Instead, a virus infects a cell and uses the cellular processes to reproduce itself. 

The following is a simplified description of how a DNA-contalning virus infects a cell; RNA viruses will 
be disregarded In this Introduction for the sake of clarity. Rrst, a virus attached to or enters a ceil; normally 
called a "host" cell. The DNA from the virus (and possible the entire viral particle) enters the host cell 
where it usually operates as a plasmid (a loop of extra-chromosomal DNA). The viral DNA is transcribed 
into messenger RNA, which is translated into one or more polypeptides. Some of these polypeptides are 
assembled into new capsids, while others act as enzymes to catalyze various biochemical reactions. The 
viral DNA is also replicated and assembled with the capsld polypeptides to form new viral particles. These 
viral particles may be released gradually by the host cell, or they may cause the host ceil to lyse and 
release them. The released viral particles subsequently infect new host cells. For more background 
Information on viruses see, e.g.. Stryer, 1981 and Matthews, 1970. 

As used herein, the term "virus" includes phages and viroids. as well as replicative intermediates. As 
used herein, the phrases "viral nucleic acid" and "DNA or RNA derived from a virus" are construed broadly 
to Include any DNA or RNA that is obtained or derived from the nucleic acid of a virus. For example, a DNA 
strand created by using a viral RNA strand as a template, or by chemical synthesis to create a known 
sequence of bases determined by analyzing viral DNA, would be regarded as viral nucleic add. 

The host range of any virus (i.e., the variety of cells that a type of virus is capable of infecting) Is 
limited. Some viruses are capable of efficient infection of only certain types of bacteria; other viruses can 
infect only plants, and may be limited to certain genera; some viruses can Infect only mammalian cells. 
Viral Infection of a cell requires more than mere entry of the viral DNA or RNA Into the host cell; viral 
particles must be reproduced within the cell. Through various assays, those skilled in the art can readily 
determine whether any particular type of virus Is capable of infecting any particular genus, species, or strain 
of cells. As used herein, the term "plant virus" is used to designate a virus which is capable of infecting one 
or more types of plant cells, regardless of whether It can infect other types of ceils. 

With the possible exception of viroids (which are poorly understood at present), every viral particle must 
contain at least one gene which can be "expressed" in infected host cells. The expression of a gene 
requires that a segment of DNA or RNA must be transcribed into or function as a strand of messenger RNA 
(mRNA), and the mRNA must be translated into a polypeptide. Most viruses have about 5 to 10 different 
genes, all of which are expressed in a suitable host cell. 

Promoters from viral genes have been utilized in a variety of genetic engineering applications. For 
example, chimeric genes have been constructed using various structural sequences (also called coding 
sequences) taken from bacterial genes, coupled to promoters taken from viruses which can Infect 
mammalian cells (the most commonly used mammalian viruses are designated as Simian Virus 40 (SV40) 
and Herpes Simplex Virus (HSV)). These chimeric genes have been used to transform mammalian cells. 
See, e.g., Mulligan et al 1979; Southern and Berg 1982. In addition, chimeric genes using promoters taken 
from viruses which can Infect bacterial cells have been used to transform bacterial cells; see, e.g., the 
phage lambda PL promoter discussed In Manlatis et al, 1982. 

Several researchers have theorized that ft might be possible to utilize plant viruses as vectors for 
transforming plant ceils. See, e.g.. Kohn et al, 1982. In general, a "vector" Is a DNA molecule useful for 
transferring one or more genes into a cell. Usually, a desired gene is Inserted into a vector, and the vector 
is then used to infect the host cell. 

Several researchers have theorized that it might be possible to create chimeric genes which are 
capable of being expressed in plant cells, by using promoters derived from plant virus genes. See, e.g., 
Hohnetal. 1982, at page 216. 
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However, despite the efforts of numerous research teams, prior to this Invention no one had succeeded 
in (1) creating a chimeric gene comprising a plant virus promoter coupled to a heterologous structural 
sequence and (2) demonstrating the expression of such a gene in any type of plant cell. 

5 

Cauliflower Mosaic Virus (CaMV) 

The entire DNA sequence of CaMV has been published. Gardner et al, 1981; Hohn et al. 1982. In its 
most common form, the CaMV genome Is about 8000 bp long. However, various naturally occurring 

ro infective mutants which have deleted about 500 bp have been discovered; see Howarth et al 1981. The 
entire CaMV genome is transcribed into a single mRNA, with a sedimentation coefficient of 32S. The 
promoter for the 32S mRNA is located In the large Intergenic region about 1 kb counterclockwise from Gap 
1 (see Guilley et al, 1982). 

CaMV is believed to generate at least eight proteins; the corresponding genes are designated as Genes 

75 I through VIII. Gene VI Is transcribed into mRNA with a sedimentation coefficient of 195. The 19S mRNA is 
transcribed into a protein designated as P66, which is an inclusion body protein. The 19S mRNA Is 
promoted by the 19S promoter, located about 2.5 kb counterclockwise of Gap 1. 



20 SUMMARY OF THE INVENTION 

This invention relates to chimeric genes which are capable of being expressed in plant cells, and to a 
method for creating such genes. 

The chimeric gene comprises a promoter region which is capable of causing RNA polymerase in a 

25 plant ceil to create messenger RNA corresponding to the DNA. One such promoter region comprises a 
nopafine synthase (NOS) promoter region, which normally exists in certain types of Tl plasmids in bacteria, 
A. tumefaciens. The NOS gene normally Is inactive while contained in A. tumefaciens cells, and it becomes 
active after the Ti plasmid enters a plant cell. Two other promoters have been derived from the cauliflower 
mosaic virus (CaMV). Other suitable promoter regions may be derived from genes which exist naturally in 

30 plant cells, or from viruses which are capable of infecting plant cells. 

The chimeric gene also contains a sequence of bases which codes for a 5' non-translated region of 
mRNA which is capable of enabling or Increasing the expression in a plant ceil of a structural sequence of 
the mRNA. For example, a suitable 5' non-translated region may be taken from the NOS gene, from a plant 
virus gene, or from a gene which exists naturally in plant cells. 

35 The chimeric gene also contains a desired structural sequence, i.e., a sequence which is transcribed 
into mRNA which is capable of being translated into a desired polypeptide. The structural sequence is 
heterologous with respect to the promoter region, and it may code for any desired polypeptide, such as a 
bacterial or mammalian protein. The structural sequence includes a start codon and a stop codon. The 
structural sequence may contain introns which are removed from the mRNA prior to translation. 

40 The chimeric gene also contains a DNA sequence which codes for a 3' non-translated region (including 
a poly-adenylation signal) of mRNA. This region may be derived from a gene which is naturally expressed 
in plant cells, to help ensure proper expression of the structural sequence. Such genes Include the NOS 
gene, plant virus genes, and genes which exist naturally in plant cells. 

The method of this invention is described below, and is summarized in the flow chart of Figure 2. 

45 If property assembled and inserted Into a plant genome, a chimeric gene of this invention will be 
expressed in the plant ceil to create a desired polypeptide, such as a mammalian hormone, or a bacterial 
enzyme which confers antibiotic or herbicide resistance upon the plant 



so Brief Description of the Drawings 

The figures herein are schematic representations; they have not been drawn to scale. 
Figure 1 represents the structure of a typical eukaryotic gene. 

Figure 2 is a flow chart representing the steps of this invention, correlated with an example chimeric 
55 NOS^NPTiKNOS gene. 

Figure 3 represents fragment Hindlll-23. obtained by digesting a Tl plasmid with Hlndlll. 
Figure 4 represents a DNA fragment which contains a NOS promoter region, a NOS 5* non-translated 
region, and the first few codons of the NOS structural sequence. 

10 
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Figure 5 represents (he cleavage of a DNA sequence at a precise location, to obtain a ONA fragment 
which contains a NOS promoter region and complete 5' non-translated region. 

Figure 6 represents the creation of plasmlds pMON1001 and pMON40, which contain an NPTII 
structural sequence. 

5 Figure 7 represents the Insertion of a NOS promoter region Into plasmid pMON40, to obtain pM0N58. 
Figure 8 represents the creation of an M13 derivative designated as M-2, which contains a NOS 3* non- 
translated region and poIy-A signal. 

Figure 9 represents the assembly of the NOS-NPTll-NOS chimeric gene, and the insertion of the 
chimeric gene into plasmid pMON38 to obtain plasmids pMON75 and pMON78. 
to Figure 10 represents the insertion of the NOS-NPTll-NOS chimeric gene into plasmid pMON120 to 
obtain plasmids pMON 1 28 and pMON1 29. 

Figure 1 1 represents the creation of plasmid pMON66, which contains an NPTI gene. 
Figure 12 represents the creation of plasmid pMON73. containing a chimeric NOS-NPTII sequence. 
Figure 13 represents the creation of plasmid pM0N78. containing a chimeric NOS-NPTI sequence. 
is Figure 14 represents the creation of plasmids pMON106 and pMON107, which contain chimeric NOS- 
NPTPRoS genes. 

Figure 15 represents the insertion of a chimeric NOS-NPT1-NOS gene Into pMON120 to obtain 
plasmids pMON130 and pMON131. 

Figure 16 represents the structure of a DNA fragment containing a soybean protein (sbss) promoter. 
20 Figure 17 represents the creation of plasmid pM0N12l, containing the sbss promoter. 

Figure 18 represents the Insertion of a chimeric sbss-NPTIKNOS gene into pMON120 to create 
plasrnldspMONHI and pMON142. 

Figure 19 represents the creation of plasmid pMON108. containing a bovine growth hormone structural 
sequence and a NOS 3' region. 
25 Figure 20 represents the creation of plasmid N25-BGH. which contains the BGH-NOS sequence 
surrounded by selected cleavage sites. 

Figure 21 represents the insertion of a chimeric sbss-BGH-NOS gene into pMON120 to obtain plasmids 
pMONWand pMON148. 

Figure 22 represents the creation of plasmid pMON149, which contains a chimeric NOS-BGH-NOS 
30 gene! 

Figure 23 represents the creation of plasmid pMON8, which contains a structural sequence for EPSP 
synthase. ' 

Figure 24 represents the creation of plasmid pMON25. which contains an EPSP synthase structural 
sequence wltn several cleavage sites near the start codon. 
as Figure 25 represents the creation of plasmid pMON146, which contains a chimeric sequence compris- 
ing EPSP synthase and a NOS 3' region. 

Figure 26 represents the insertion of a chimeric NOS-EPSP-NOS gene into PMON120 to obtain plasmid 
PMON153. 

Figure 27 represents the creation of plasmid pM0N154, which contains a chimeric sbss-EPSP-NOS 
40 gene! 

Figure 28 represents the creation and structure of plasmid pM0N93, which contains a CaMV 19S 
promoter. 

Figure 29 represents the creation and structure of plasmid pM0N156. which contains a chimeric CaMV- 
(19S7RPT-R&S gene. 

46 Figure 30 represents the creation and structure of plasmid pMONIIO, which contains a partial NPT 
gene! 

Figure 31 represents the creation and structure of plasmid pMON132 t which contains a partial NPT- 
NOS gene. 

Figure 32 represents the creation and structure of plasmid pMON155. which contains a chimeric CaMV- 
so (1 9S)-NPT-N&S gene. 

Figure 33 represents the creation and structure of plasmid pDMON8l, which contains a CaMV 32S 
promoter. 

Figure 34 represents the creation and structure of plasmid pMON125. which contains a CaMV 32S 
promoter. 

55 Figure 35 represents the creation and structure of plasmid pMON172. which contains a CaMV 32S 
promoter. 

Figure 36 represents the creation and structure of phage M12. which contains a CaMV 32S promoter. 
Figure 37 represents the creation and structure of plasmids pMON183 and pMON184. which contain 
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chimeric CaMV(32S)-NPT-NOS genes. 



DETAILED DESCRIPTION OF THE INVENTION 

5 

In one preferred embodiment of this invention, a chimeric gene was created which contained the 
following elements: 

1 . a promoter region and a 5* non-translated region derived from a nopaline synthase (NOS) gene; 

2. a structural sequence derived from a neomycin phosphotransferase II (NPTll or NPT II) gene; and, 
to 3. a 3* non-translated region, including a poly-adenylation signal, derived from a NOS gene. 

This chimeric gene, referred to herein as a NOS-NPTH-NOS gene, was assembled and inserted Into a 
variety of plant cells, causing them to become resistant to aminoglycoside antibiotics such as kanamycin. 

The method used to assemble this chimeric gene Is summarized in the flow chart of Figure Z and 
described In detail below and In the examples. To assist the reader in understanding the steps of this 

is method, various plasmlds and fragments involved in trie NOS-NPTH-NOS chimeric gene are cited in 
parentheses in Figure 2. However, the method of Figure 2 is applicable to a wide variety of other plasmlds 
and fragments. To further assist the reader, the steps shown in Figure 2. have been assigned caltout 
numbers 42 et seq. These callout numbers are cited In the following description. The techniques and DNA 
sequences of this invention are likely to be useful in the transformation of a wide variety of plants, including 

20 any plant which may be Infected by one or more strains of A. tumefaclens or A. rhtzogenes. 



The NOS Promoter Region and 5* Non-translated Region 

25 The Applicants decided to obtain and utilize a nopaline synthase (NOS) promoter region to control the 
expression of the heterologous gene. The NOS Is normally carried in certain types of Ti plasmlds, such as 
PT1T37 (Sciaky et al, 1978). The NOS promoter is normally inactive while in an A. tumefaclens cell. The 
entire NOS gene. Including the promoter and the protein coding sequence, is within the T-DNA portion of a 
Ti plasmld that is Inserted into the chromosomes of plant cells when a plant becomes infected and forms a 

30 crown gall tumor. Once inside the plant cell, the NOS promoter region directs RNA polymerase wKhin a 
plant cell to transcribe the NOS protein coding sequence into mRNA, which Is subsequently translated into 
the NOS enzyme. 

The boundaries between the different parts of a promoter region (shown in Figure 1 as association 
region 2. intervening region 4, transcription initiation sequence 6, and intervening region 8), and the 

as boundary between the promoter region and the 5' non-translated region, are not fully understood. The 
Applicants decided to utilize the entire promoter region and 5' non-translated region from the NOS gene, 
which is known to be expressed in plant cells. However, It is entirely possible that one or more of these 
sequences might be modified in various ways, such as alteration In length or replacement by other 
sequences. Such modifications in promoter regions and 5* non-translated regions have been studied in 

40 bacterial cells (see, e.g., Roberts et al 1979) and mammalian cells (see. e.g., McKnlght 1982). By utilizing 
the methodology taught by this invention, it Is now possible to study the effects of modifications to promoter 
regions and 5* non-translated regions on the expression of genes In plant cells. It may be possible to 
Increase the expression of a gene in a plant cell by means of such modifications. Such modifications, if 
performed upon chimeric genes of this Invention, are within the scope of this invention. 

<5 A nopafine-type tumor-inducing plasmld, designated as pTiT37, was isolated from a strain of A. 
tumefaclens using standard procedures (Currier and Nester. 1976). It was digested with the endonuclease 
Hindlll which produced numerous fragments. These fragments were separated by size on a gel, and one of 
the fragments was isolated and removed from the gel. This fragment was designated as the Hlndlll-23 
fragment because it was approximately the 23rd largest fragment from the Ti plasmld; It Is approximately 

so 3400 base pairs (bp) in size, also referred to as 3.4 Wlobases (kb). From work by others (see, e.g.. 
Hemalsteens et al. 1980), it was known that the Hindi) I-23 fragment contained the entire NOS gene, 
Including the promoter region, a 5' no n- translated region, a structural sequence with a start codon and a 
stop codon, and a 3' non-translated region. The Hlndlll-23 fragment Is shown In Figure 3. 

By means of various cleavage and sequencing experiments, it was determined that the Hlndlll-23 

55 fragment could be digested by another endonuclease, Sau3a, to yield a fragment, about 350 bp in size, 
which contains the entire NOS-promoter region, the 5* non-translated region, and the first few codons of the 
NOS structural sequence. This fragment was sequenced, and the base sequence Is represented in Figure 4. 
The start codon (ATQ) of the NOS structural sequence begins at base pair 301 within the 350 bp fragment 
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The Applicants decided to cleave the fragment between base pairs 300 and 301; this would provide them 
with a fragment about 300 base pairs long containing a NOS promoter region and the entire 5' non- 
translated region but with no translated bases. To cleave the 350 bp fragment at precisely the right location, 
the Applicants obtained an M13 clone designated as SIA, and utilized the procedure described below. 

6 To create the SIA clone. Dr. Michael Bevan of Washington University converted the 350 bp Sau3a 
fragment Into a single strand of DNA. This was done by utilizing a virus vector, designated as the M1 3 mp2 
phage, which goes through both double-stranded (ds) and single-stranded (ss) stages in Its life cycle 
(Messing et al, 1981). The ds 350 bp fragment was Inserted Into the double-stranded repilcative form DNA 
of the M13 mp2, which had been cleaved with BamHI. The two fragments were ligated. and used to infect 

w E. coO cells. The ds DNA containing the 350 bp Inserted fragment subsequently replicated, and one strand 
(thevtral strand) was encapsulated by the M13 viral capsld proteins. In one clone, designated the SIA, the 
orientation of the 350 bp fragment was such that the anti-sense strand (containing the same sequence as 
the mRNA) of the NOS gene was carried in the viral strand. Viral particles released from infected cells were 
isolated, and provided to the Applicants. 

is Single stranded SIA DNA, containing the anti-sense 350 bp fragment with the NOS promoter region, 
was Isolated from the viral particles and sequenced. A 14-mer oligonucleotide primer was synthesized, 
using published procedures (Beaucage and Carruthers. 1981, as modified by Adams et ai, 1982). This 14- 
mer was designed to be complementary to bases 287 through 300 of the 350 bp fragment as shown on 
Figure 4. 

20 The 5' end of the synthetic primer was radioactively labelled with 32 P; this is represented in the figures 
by an asterisk. 

Copies of the primer were mixed with copies of the single-stranded SIA DNA containing the anti-sense 
strand of the 350 bp fragment The primer annealed to the desired region of the SIA DNA, as shown at the 
top of Rgure 5. After this occurred, Klenow DNA polymerase and a controlled quantity of unlabelled deoxy- 
25 nucleoside triphosphates (dNTP's), A, T. C. and Q, were added. Klenow polymerase added nucleotides to 
the 3' (unlabelled) end of the primer, but not to the 5* (labelled) end. The result, as shown In Rgure 5, was a 
circular loop of single-stranded DNA. part of which was matched by a second strand of DNA. The 5' end of 
the second strand was located opposite base #300 of the Sau3a insert 

The partially double-stranded DNA was then digested by a third endonuctease, Haelll, which can cleave 
30 both single-stranded and double-stranded DNA. Haelll cleavage sites were known to exist in several 
locations outside the 350 bp insert but none existed inside the 350 bp insert This created a fragment 
having one blunt end, and one 3* overhang which started at base #301 of the Sau3a insert 

The Haelll fragment mixture was treated with T4 DNA polymerase and unlabelled dNTP's. This caused 
the single stranded portion of the DNA. which extended from base #301 of the Sau3a Insert to the closest 
ss Haelll cleavage site, to be removed from the fragment In this manner, the ATG start codon was removed 
from base pair #300. leaving a blunt end double-stranded fragment which was approximately 550 bp long. 

The mixture was then digested by a fourth endonuclease EcoRI, which cleaved the 550 bp fragment at 
a single site outside the NOS promoter region. The fragments were then separated by size on a gel, and 
t*o radioactively-labelled fragment was Isolated. This fragment contained the entire NOS promoter region 
40 and 5* non-translated region, ft had one blunt end with a sequence of 

5 f -...CTGCA 
* • .GACGT 

45 

and one cohesive end (at the EcoRI site) with a sequence of 

S f AAITC- 

50 G~ 



The shorter strand was about 308 bp long. 

The foregoing steps are represented in Rgure 2 as steps 42, 44, and 48. 

This fragment was inserted into pMON40 (which is described below) to obtain pMON58. as shown on 
Rgure 7. 
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Creation of plasmid with NPT II gene (pMON40) 

A bacteria] transposcn, designated as Tn5. Is known to contain a complete NPT It gene, including the 
promoter region, structural sequence, and 3' non-translated region. The NPT II enzyme inactivates certain 
5 aminoglycoside antibiotics, such as kanamycin, neomycin, and G418; see Jimenez and Davies, 198CL This 
gene is contained within a 1.8 kb fragment which can be obtained by digesting phage lambda bbkan-1 
DNA (D. Berg et at, 1975) with two endonudeases, Hindlll and BamHJ. This fragment was inserted into a 
common laboratory plasmid. pBR327, which had been digested by Hindi!! and BamHI. As shown in Figure 
6, the resulting plasmid was designated as pMONIOOI. which was about 4.7 kb. 
to To reduce the size of the DNA fragment which carried the NPT II structural sequence, the Applicants 
eliminated about 500 bp from the pMONIOOI plasmid, in the following manner. First they digested 
pMONIOOI at a unique Smal restriction site which was outside of the NPT II gene. Next they inserted a 10- 
mer synthetic oligonucleotide linker. 

5 r CCGGATCCGG 
GGCCTAGGCC 

into the Smal cleavage site. This eliminated the Smal cleavage site and replaced it with a BamHI cleavage 
site. A second BamHI cleavage site already existed, about 500 bp from the new BamHI site. The Applicants 
digested the plasmid with BamHI. separated the 500 bp fragment from the 4.2 kb fragment, and circularized 
the 4.2 kb fragment. The resulting plasmlds were inserted into E. coll, which were then selected for 
resistance to ampiciliin and kanamycin. A clonal colony of E. cofl~was selected; these ceils contained a 
plasmid which was designated as pMON40, as shown in Figure" £ 
The foregoing steps are represented In Figure 2 as steps 48 and 50. 



Insertion of NOS promoter into plasmid pMON40 

The Applicants deleted the NPT II promoter from pMON40. and replaced it with the NOS promoter 
fragment described previously, by the following method, shown on Rgure 7. 

Previous cleavage and sequencing experiments (Rao and Rogers, 1979; Auerswald et al, 1980) 
indicated that a Bglll cleavage site existed in the NPT II gene between the promoter region and the 
structural sequence. Plasmid pMON40 was digested with Bglll. The cohesive ends were then filled in by 
mixing the cleaved plasmid with Klenow polymerase and the four dNTP's. to obtain the following blunt 
ends: 

5» - AGATC GATCT- 
40 - TCTAG CTAGA-5 1 

The polymerase and dNTP's were removed, and the cleaved plasmid was then digested with EcoRI. The 
smaller fragment which contained the NPT II promoter region was removed, leaving a large fragment with 

<s one EcoRI end and one blunt end. This large fragment was mixed with the 308 bp fragment which 
contained the NOS promoter, described previously and shown on Rgure 5. The fragments were Hgated, and 
inserted into E. coll. E. coli clones were selected for ampiciliin resistance. Replacement of the NPT II 
promoter region (a bacteriaTpromoter) with the NOS promoter region (which is believed to be active only in 
plant cells) caused the NPT II structural sequence to become inactive In E. cog. Plasmlds from 36 

so kanarnycin-sensitive clones were obtained; the plasmid from one clone, designated as pMON58, was 
utilized in subsequent work. 

The foregoing steps are represented in Figure 2 as steps 52 and 54. 

Plasmid pMON56 may be digested to obtain a 1.3 kb EcoRl-BamHI fragment which contains the NOS 
promoter region, the NOS 5* non-translated region, and the NPT II structural sequence. This step Is 
represented in Figure 2 as step 50. 



Insertion of NOS 3' sequence into NPT II gene 
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As mentioned above in "Background Art", the functions of 3* non-translated regions in eucaryotic genes 
are not fully understood. However, they are believed to contain at least one important sequence, a poly* 
adenylation signal. 

It was suspected by the Applicants that a gene having a bacterial 3' non-translated region might not be 
s expressed as effectively in a plant ceil as the same gene having a 3* non-translated region from a gene, 
such as NOS, which is known to be expressed in plants. Therefore, the Applicants decided to add a NOS 3' 
non-translated region to the chimeric gene, in addition to the NPT II 3* non-translated region already 
present Alternately, it is possible, using the methods described herein, to delete the NPT II or other 
existing 3' non-translated region and replace it with a desired 3' non-translated region that is known to be 
10 expressed In plant cells. Whether a different type of 3 ( non-translated region (such as a 3* region from an 
octopine-type or agropine-type Ti plasmid. or a 3* region from a gene that normally exists in a plant ceil) 
would be suitable or preferable for use in any particular type of chimeric gene, for use In any specific type 
of plant ceil, may be determined by those skilled in the art through routine experimentation using the 
method of this Invention. 

is Those skilled in the art may also determine through routine experimentation whether the 3' non- 
translated region that naturally follows a structural sequence that Is to be inserted into a plant cell will 
enhance the efficient expression of that structural sequence in that type of plant ceil. If so, then the steps 
required to insert a different 3" non-translated region into the chimeric gene might not be required in order 
to perform the method of this invention. 

20 In order to obtain a DNA fragment containing a NOS 3' non-translated region appropriate for joining to 
the NPT il structural sequence from pMON58 (described previously), the Applicants utilized a 3.4 kb Hindlll- 
23 fragment from a Tl plasmid, shown on Figure 3. This 3.4 kb fragment was isolated and digested with 
BamHI to obtain a 1.1 kb BamHI-Hindlil fragment containing a 3* portion of the NOS structural sequence 
(including the stop codon), and the 3* non-translated region of the NOS gene (including the poly-adenylation 

25 signal). This 1.1 kb fragment was inserted into a pBR327 plasmid which had been digested with Hindlll and 
BamHI. The resulting plasmid was designated as pMON42, as shown on Figure 8. 

Plasmid pMON42 was digested with BamHI and Rsal. and a 720 bp fragment containing the desired 
NOS 3* non-translated region was purified on a gel. The 720 bp fragment was digested with another 
endonuciease. MboJ. and treated with the large fragment of E. coll DNA polymerase I. This resulted in a 260 

30 bp fragment with Mbol blunt ends, containing a large part of the NOS 3' non-translated region including the 
poty-A signal. 

The foregoing procedure is represented in Figure 2 by step 58. However, it is recognized that alternate 
means could have been utilized; for example, it might have been possible to digest the Hindlil-23 fragment 
directly with Mbol to obtain the desired 260 bp fragment with the NOS 3* non-translated region. 

35 

Assembly of Chimeric Gene 

To complete the assembly of the chimeric gene, it was necessary to ligate the 260 bp Mbol fragment 
40 (which contained the NOS 3' non-translated region) to the 1.3 kb EcoRI-BamHI fragment from pMON58 
(which contained the NOS promoter region and 5' non-translated region and the NPT II structural 
sequence). In order to facilitate this ligation and control the orientation of the fragments, the Applicants 
decided to convert the Mbol ends of the 260 bp fragment Into a BamHI end (at the 5' end of the fragment) 
and an EcoRI end (at the 3* end of the fragment). In order to perform this step, the Applicants used the 
45 following method. 

The 260 bp Mbol fragment, the termini of which had been converted to blunt ends by Klenow 
polymerase, was inserted into M13 mp8 DNA at a Smal cleavage site. The Smal site is surrounded by a 
variety of other cleavage sites present in the M1 3 mp8 DNA, as shown in Figure 8. The Mbol fragment 
could be Inserted into the blunt Smal ends in either orientation. The orientation of the Mbol fragments in 
so different clones were tested, using Hinfl cleavage sites located assymetricaliy within the Mbol fragment A 
clone was selected in which the 3' end of the NOS 3' non-translated region was located near the EcoRI 
cleavage site in the M13 mp8 DNA. This done was designated as the M-2 clone, as shown In Figure 8. 

Replicatlve form (double stranded) DNA from the M-2 clone was digested by EcoRI and BamHI and a 
280 bp fragment was Isolated. Separately, plasmid pMON58 was digested by EcoRI and BamHI, and a 
65 1300 bp fragment was Isolated. The two fragments were Ggated, as shown in Figure 9, to complete the 
assembly of a NOS-NPTll-NOS chimeric gene having EcoRI ends. 

There are a variety of ways to control the ligation of the two fragments. For example, the two EcoRI- 
BamHI fragments could be joined together with DNA ligase and cleaved with EcoRI. After inactivatlon of 
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EcoRI, a vector molecule having EcoR! ends that were treated with calf alkaline phosphatase (CAP) may be 
added to the mixture. The fragments In the mixture may be ligated In a variety of orientations. The plasmid 
mixture is used to transform E. con. and cells having plasmids with the desired orientation are selected or 
screened, as described below. 

s A plasmid, designated as pMON38. was created by insertion of the Hlndlll-23 fragment (from Tl plasmid 
pTiT37) Into the Hlndlll cleavage site of the plasmid pBR327. Plasmid pMON38 contains a unique EcoRI 
site, and an amplcillin-resistance gene which is expressed in E. colL Plasmid pMON38 was cleaved with 
EcoR] and treated with alkaline phosphatase to prevent it from re-Hgating to Itself. U.S. Patent 4,264,731 
(Shine. 1981). The resulting fragment was mixed with the 1300 bp NOS-NPTII fragment from pMON58, and 

10 the 280 bp NOS fragment from M-2, which had been ligated and EcoRI-cIeaved as described in the 
previous paragraph. The fragments were ligated. and Inserted into E. coU. The E. coll ceHs which had 
acquired intact plasmids with ampicillirwesistance genes were selected on plates containing ampicilfin. 
Several clones were selected, and the orientation of the inserted chimeric genes was evaluated by means 
of cleavage experiments. Two clones having plasmids carrying NOS-NPT IhNOS inserts with opposite 

is orientations were selected and designated as pMON75 and pMON76, as shown in Figure 8. The chimeric 
gene may be isolated by digesting either pMON75 or pM0N76 with EcoRI and purifying a 1580 bp 
fragment 

The foregoing procedure is represented on Figure 2 by step 60. 

This completes the discussion of the NOS-NPTIi-NOS chimeric gene. Additional Information on the 
20 creation of this gene is provided in the Examples. A copy of this chimeric gene is contained in plasmid 
pMON128; it may be removed from pMON128 by digestion with EcoRI. A culture of E. coll containing 
PMON128 has been deposited with the American Type Culture Collection; this culture has Been assigned 
accession number 39264. 

To prove the utility of this chimeric gene, the Applicants inserted it into plant ceils. The NPTil structural 
25 sequence was expressed in the plant cells, causing them and their descendants to acquire resistance to 
concentrations of kanamycin which are normally toxic to plant cells. 



Creation of NPT I Chimeric Gene 

30 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
NOS promoter region and 5* non-translated region, [2) a structural sequence which codes for NPT I, and (3) 
a NOS 3* non-translated region. 

NPT I and NPT II are different and distinct enzymes with major differences in their amino acid 
ss sequences and substrate specificities. See. e.g.. E. Beck et a). 1982. The relative stabilities and activities of 
these two enzymes in various types of plant cells are not yet fully understood, and NPT I may be preferable 
to NPT II for use in certain types of experiments and plant transformations. 

A 1200 bp fragment containing an entire NPT I gene was obtained by digesting pACY177 (Chang and 
Cohen, 1978) with the endonuclease. Avail. The Avail termini were converted to blunt ends with Kienow 
40 polymerase, and converted to BamHI termini using a synthetic linker. This fragment was inserted into a 
unique BamHI site in a pBR327-derfved plasmid, as shown in Figure 11. The resulting plasmid was 
designated as pMON66. 

Plasmid pMON57 (a deletion derivative of pBR327, as shown in Figure 11) was digested with Avail. The 
225 bp fragment of pMON57 was replaced by the analogous 225 bp Avail fragment taken from plasmid 

45 pUC8 (Vloira and Messing, 1982), to obtain a derivative of pMON57 with no Psti cleavage sites. This 
plasmid was designated as pM0N67. 

Plasmid pMON58 (described previously and shown in Figure 7) was digested with EcoRI and BamHI to 
obtain a 1300 bp fragment carrying the NOS promoter and the NPT II structural sequence. This fragment 
was Inserted Into pMON67 which had been digested with EcoRI and BamHI. The resulting plasmid was 

so designated as pMON73. as shown in Figure 12. 

pMON73 was digested with Pstl and BamHI. and a 2.4 kb fragment was isolated containing a NOS 
promoter region and 5* non-translated region. Plasmid pMON86 (shown on Figure 11) was digested with 
Xhoi and BamHI to yield a 950 bp fragment containing the structural sequence of NPT I This fragment 
tacked about 30 nucleotides at the 5' end of the structural sequence. A synthetic linker containing the 

55 missing bases, having appropriate Pstl and Xhol ends, was created. The pMON73 fragment, the pMON68 
fragment, and the synthetic linker were ligated together to obtain plasmid pMON78, as shown In Figure 13. 
This plasmid contains the NOS promoter region and 5* non-translated region Joined to the NPT I structural 
sequence. The ATQ start codon was in the same position that the ATQ start codon of the NOS structural 



18 



EP 0 131 623 B1 



sequence had occupied. 

Plasmid pMON78 was digested with EcoRI and BamHI to yield a 1300 bp fragment carrying the 
chimeric NOS-NPT I regions. Doyle-stranded DNA from the M-2 clone (described previously and shown on 
Figure 9) was digested with EcoRI and BamHI, to yield a 280 bp fragment carrying a NOS 3' non-translated 
s region with a poly-adenylation signal. The two fragments described above were ligated together to create 
the NOS-NPT l-NOS chimeric gene, which was Inserted into plasmid pMON38 (described above) which had 
been digested with EcoRI. The two resulting plasmids. having chimeric gene inserts with opposite 
orientations, were designated as pMON106 and pMON107, as shown in Figure 14. 

Either of plasmids pMON106 or pMON107 may be digested with EcoRI to yield a 1.8 kb fragment 
io containing the chimeric NOS-NPT l-NOS gene. This fragment was Inserted into plasmid pMON120 which 
had been digested with EcoRI and treated with alkaline phosphatase. The resulting plasmids, having inserts 
with opposite orientations, were designated as pMON130 and pMON131, as shown on Figure 15. 

The NOS-NPT l-NOS chimeric gene was inserted into plant cells, which acquired resistance to 
kanamycin. This demonstrates expression of the chimeric gene in plant cells. 

15 

Creation of Chimeric Gene with Soybean Promoter 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
20 promoter region and 5' non-translated region taken from a gene which naturally exists in soybean; this gene 
codes for the small subunit of ributose-1 ,5-bis -phosphate carboxylase (sbss, for soybean small subunit): (2) 
a structural sequence which codes for NPT II, and (3) a NOS 3* non-translated region. 

The sbss gene codes for a protein in soybean leaves which is involved in photosynthetic carbon 
fixation. The sbss protein is the most abundant protein in soybean leaves (accounting for about 10% of the 
25 total leaf protein), so it Is likely that the sbss promoter region causes prolific transcription. 

There are believed to be approximately six genes encoding the sbss protein in the soybean genome. 
One of the members of the sbss gene family, SRS1, which is highly transcribed in soybean leaves, has 
been cloned and characterized. The promoter region, 5' nontranslated region, and a portion of the structural 
sequence are contained on a 2.1 kb EcoRI fragment that was subcloned into the EcoRI site of plasmid 
30 pBR325 (Bolivar. 1978). The resultant plasmid. pSRS2.1, was a gift to Monsanto Company from Dr. R. B. 
Meagher, University of Georgia. Athens. CA. The 2.1 kb EcoRI fragment from pSRS2.1 is shown on Figure 
16. 

Plasmid pSRS2.1 was prepared from dam-E. coll ceils, and cleaved with Mbol to obtain an 800 bp 
fragment This fragment was inserted into plasmid pKC7 (Rao and Rogers, 1979) which had been cleaved 

35 with Bglll. The resulting plasmid was designated as pMON121, as shown on Figure 17. 

Plasmid pMON121 was digested with EcoRI and Bell, and a 1200 bp fragment containing the sbss 
promoter region was isolated. Separately, plasmid pM0N75 (described previously and shown on Rgure 9) 
was digested with EcoRI and Bglll, and a 1250 bp fragment was isolated, containing a NPT II structural 
sequence and a NOS 3' non-translated region. The two fragments were ligated at the compatible Bcll/Bglll 

40 overhangs, to create a 2450 bp fragment containing sbss-NPT ll-NOS chimeric gene. This fragment was 
Inserted into pMON120 which had been cleaved with EcoRI. to create two plasmids having chimeric gene 
inserts with opposite orientations, as shown in Figure 18. The plasmids were designated as pMON141 and 
pMON142. 

The sbss-NPTII-NOS chimeric genes were Inserted Into several types of plant celts, causing the plant 
45 cells to acquire resistance to kanamycin. 

This successful transformation proved that a promoter region from one type of plant can cause the 
expression of a gene within plant ceils from an entirely different genus, family, and order of plants. 

The chimeric sbss-NPT ll-NOS gene also had another significant feature. Sequencing experiments 
indicated that the 800 bp Mbol fragment contained the ATG start codon of the sbss structural sequence, 
so Rather than remove this start codon. the Applicants decided to insert a stop codon behind it in the same 
reading frame. This created a cficistronic mRNA sequence, which coded for a truncated amino portion of the 
sbss polypeptide and a complete NPT II polypeptide. Expression of the NPT II polypeptide was the first 
proof that a dicistronlc mRNA can be translated within plant cells. 

The sbss promoter is contained in plasmid pMON154, described below. A culture of E. coll containing 
55 this plasmid has been deposited with the American Type Culture Collection. This culture has been assigned 
accession number 39265. 
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Creation of BGH Chimeric Genes 

In an alternate preferred embodiment of this invention, a chimeric gene was created comprising (1) a 
sbss promoter region and 5* non-translated region, (2) a structural sequence which codes for bovine growth 

5 hormone (BGH) and (3) a NOS 3' non-translated region. This chimeric gene was created as follows. 

A structural sequence which codes for the polypeptide, bovine growth hormone, (see. e.g., Woychik et 
a), 1982) was inserted into a pBR322-derived plasmid. The resulting plasmid was designated as plasmid 
CF-1. This plasmid was digested with EcoRI and Hindlll to yield a 570 bp fragment containing the structural 
sequence. Double stranded M-2 RF DNA (described previously and shown in Figure 8) was cleaved with 

io EcoRI and Htndill to yield a 290 bp fragment which contained the NOS 3' non-translated region with a pofy- 
adenylation signal. The two fragments were iigated together and digested with EcoRI to create an 860 base 
pair fragment with EcoRI ends, which contained a BGH-coding structural sequence joined to the NOS 3* 
non-translated region. This fragment was introduced Into plasmid pM0N38. which had been digested with 
EcoRI and treated with alkaline phosphatase, to create a new plasmid, designated as pMON 108, as shown 

15 in Figure 19. 

A unique Bglll restruction site was Introduced at the 5* end of the BGH structural sequence by digesting 
pMON 108 with EcoRI to obtain the 880 bp fragment and using Klenow polymerase to create blunt ends on 
the resulting EcoRI fragment. This fragment was Iigated into plasmid N25 (a derivative of pBR327 
containing a synthetic linker carrying Bglll and Xbal cleavage sites inserted at the BamHI site), which had 

20 been cleaved with Xbal and treated with Klenow polymerase to obtain blunt ends (N25 contains a unique 
Bglll site located 12 bases from the Xbal site). The resulting plasmid, which contained the 860bp BGH-NOS 
fragment in the orientation shown in Figure 20. was designated as plasmid N25-BGH. This plasmid contains 
a unique Bglll cleavage site located about 25 bases from the 5' end of the BGH structural sequence. 

Plasmid N25-BGH prepared from dam- E. con cells was digested with Bglll and Clal to yield an 860 bp 

25 fragment which contained the BGH structural sequence Joined to the NOS 3' non-translated region. 
Separately, plasmid pMON121 (described previously and shown In Figure 17) was prepared from dam- E. 
coll cells and was digested with Clal and Bell to create an 1100 bp fragment which contained the sbss 
promoter region. The fragments were Iigated at their compatible Bcll/Bglll overhangs, and digested with Clal 
to yield a Clal fragment of about 2 kb containing the chimeric sbss-BGH-NOS gene. This fragment was 

30 inserted into pMONl20 (described previously and shown In Figure 10) which had been digested with Clal. 
The resulting plasmids. containing the inserted chimeric gene In opposite orientations were designated 
pMON147 and pMON148, as shown in Figure 21. 

An alternate chimeric BGH gene was created which contained (1) a NOS promoter region and 5' non- 
translated region. (2) a structural sequence which codes for BGH, and (3) a NOS 3' non-translated region, 

as by the following method, shown In Figure 22. 

Plasmid pMON76 (described above and shown in Figure 9) was digested with EcoRI and Bglll to obtain 
a 308 bp fragment containing a NOS promoter region and 5* non-translated region. Plasmid N25-BGH 
prepared from dam- E. coQ cells (described above and shown in Figure 20) was digested with Bglll and Clal 
to obtain a 900 bp fragment containing a BGH structural sequence and a NOS 3' non-translated region. 

40 These two fragments were Iigated together to obtain a chimeric NOS-BGH-NOS gene in a fragment with 
EcoRI and Clal ends. This fragment was Iigated with an 8 kb fragment obtained by digesting pMON120 with 
EcoRI and Clal. The resulting plasmid, designated as pMON149, is shown in Figure 22. 



45 Creation of Chimeric NOS-EPSP-NOS Gene 

In an alternate preferred embodiment, a chimeric gene was created comprising (1) a NOS promoter 
region and 5' non-translated region, (2) a structural sequence which codes for the E. cofi enzyme, 5-enol 
pyruvyl shiWmate-3-phosphoric acid synthase (EPSP synthase) and (3) a NOS 3' non-transTated region. 

so EPSP synthase Is believed to be the target enzyme for the herbicide, glyphosate. which Is marketed by 
Monsanto Company under the registered trademark, "Roundup." Glyphosate is known to inhibit EPSP 
synthase activity (Amrhein et a), 1980), and amplification of the EPSP synthase gene in bacteria Is known to 
increase their resistance to glyphosate. Therefore, increasing the level of EPSP synthase activity in plants 
may confer resistance to glyphosate in transformed plants. Since glyphosate is toxic to most plants, this 

55 provides for a useful method of weed control. Seeds of a desired crop plant which has been transformed to 
increase EPSP synthase activity may be planted in a field. Glyphosate may be applied to the field at 
concentrations which will kill all non-transformed plants, leaving the non-transformed plants unharmed. 
An EPSP synthase gene may be isolated by a variety of means, including the following. A lambda 
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phage library may bo created which carries a variety of DNA fragments produced by HindHI cleavage of E. 
coli DNA. See. e.g„ Maniatis et al f 1982. — 

The EPSP synthase gene Is one of the genes which are Involved In the production of aromatic amino 
acids. These genes are designated as the "are" genes; EPSP synthase is designated as aroA . Cells which 
s do not contain functional aro genes are designated as aro- cells. Aro- cells must normally be grown on 
media supplemented by aromatic amino acids. See Pittardand WallisTra66. 

Different lambda phages which carry various HindHI fragments may be used to infect mutant E. coli 
cells which do not have EPSP synthase genes. The Infected aro- cells may be cultured on media which 
does not contain the aromatic amino acids, and transformed aro+ clones which are capable of growing on 
io such media may be selected. Such clones are likely to contain the EPSP synthase gene. Phage particles 
may be isolated from such clones, and DNA may be isolated from these phages. The phage DNA may be 
cleaved with one or more restriction endonucleases. and by a gradual process of analysis, a fragment which 
contains the EPSP synthase gene may be isolated. 

Using a procedure similar to the method summarized above, the Applicants isolated an 11 kb HindHI 
/s fragment which contained the entire E. coll EPSP synthase gene, this fragment was digested with Bglll to 
produce a 3.5 kb HindllHBglll fragment which contained the entire EPSP synthase gene. This 3.5 kb 
fragment was inserted Into plasmid pkC7 (Rao and Rogers, 1979) to produce plasmld pMON4, when is 
shown in Figure 23. 

Piasmid pM0N4 was digested with Cial to yield a 2.5 kb fragment which contained the EPSP synthase 
20 structural sequence. This fragment was inserted Into pBR327 that had been digested with Clal, to create 
pM0N8, as shown In Figure 23. 

pMON8 was digested with BamHI and Ndel to obtain a 4.9 kb fragment. This fragment lacked about 
200 nucleotides encoding the amino terminus of the EPSP synthase structural sequence. 

The missing nucleotides were replaced by figating a Hinfl/Ndel fragment, obtained from pMON8 as 
25 shown in Figure 24, together with a synthetic oligonucleotide sequence containing (1) the EPSP synthase 
start codon and the first three nucleotides, (2) a unique Bglll site, and (3) the appropriate BamHI and Hinfl 
ends. The resulting plasmid, pMON25, contains an intact EPSP synthase structural sequence with unique 
BamHI and Bglll sites positioned near the start codon. 

Double stranded M-2 DNA (described previously and shown in Figure 8) was digested with HindHI and 
30 EcoRI to yield a 290 bp fragment which contains the NOS 3' non-translated region and pory-adenylation 
signal. This fragment was introduced Into a pMON25 plasmid that had been digested with EcoRI and HindHI 
to create a plasmid, designated as pMON146 (shown in Figure 25) which contains the EPSP structural 
sequence joined to the NOS 3' non-translated region. 

pMONl46 was cleaved with Clal and Bglll to yield a 2.3 kb fragment carrying the EPSP structural 
35 sequence joined to the NOS 3 V non-translated region. pMON76 (described previously and shown in Rgure 
9) was digested with Bglll and EcoRI to create a 310 bp fragment containing the NOS promoter region and 
5' non-translated region. The above fragments were mixed with pMONl20 (described previously and shown 
in Figure 10) that had been digested with Clal and EcoRI. and the mixture was ligated. The resulting 
plasmld. designated pMON153, is shown In Rgure 26. This plasmid contains the chimeric NOS-EPSP-NOS 
40 gene. 

A piasmid containing a chimeric sbss-EPSP-NOS gene was prepared in the following manner, shown in 
Rgure 27. Plasmid pMON146 (described previously and shown in Rgure 25) was digested with Clal and 
Bglll. and a 2.3 kb fragment was purified. This fragment contained the EPSP synthase structural sequence 
coupled to a NOS 3* non-translated region with a pory-adenylation signal. Plasmld pMON121 (described 

46 above and shown in Rgure 17) was digested with Clal and Bell, and a 1.1 kb fragment was purified. This 
fragment contains an sbss promoter region and 5* non-translated region. The two fragments were mixed 
and ligated with T4 DNA ligase and subsequently digested with Clal. This created a chimeric sbss-EPSP- 
NOS gene, joined through compatible Bglll and Bell termini. This chimeric gene with Clal termini was 
inserted into plasmid pMON120 which had been digested with Clal and treated with calf alkaline 

so phosphatase (CAP). The mixture was ligated with T4 DNA ligase. The resulting mixture of fragments and 
plasmids was used to transform E. col) cells, which were selected for resistance to spectinomycln. A colony 
of resistant cells was isolated, ancfthe plasmid in this colony was designated as pMON154, as shown in 
Rgure 27. 

A culture of E. coli containing pMON154 has been deposited with the American Type Culture Center. 
55 This culture has been assigned accession number 39265. 



Creation of CaMV(19S)-NPT ll-NOS Genes 
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In an alternate preferred embodiment of this invention, a chimeric gene was created which contained 
the following elements: 

1. a promoter region and a 5' non-translated region derived from the CaMV (19S) gene, which codes for 
the P66 protein, 

s 2. a partial coding sequence from the CaMV (19S) genes including an ATG start codon and several 
internal ATG sequences, ail of which were In the same frame as a TQA termination sequence immediately 
inside the desired ATG start codon of the NPTH gene; 

3. a structural sequence derived from a neomycin phosphotransferase II (NPT II) gene: this sequence 
was preceded by a spurious ATG sequence, which was in the same reading frame as a TGA sequence 

io within the NPT II structural sequence; and, 

4. A 3' non-translated region, including a poly-adenylation signal, derived from a nopaiine synthase 
(NOS) gene. 

This chimeric gene, referred to herein as the CaMV(19S)-NPTII-NO$ gene, was inserted into plasmid 
pMON120 to create a plasmid designated as pMON158, shown in Rgure 29 and described in Example 9. 
T5 Plasmid pM0N156 was Inserted in A. tumefaciens cells, where It formed a co-Integrate Ti plasmid by 
means of a single crossover event with a Ti plasmid in the A. tumefaciens cell. The chimeric gene In the 
co-Integrate plasmid was within a modified T-DNA region in the TI plasmid. surrounded by left and right T- 
DNA borders. 

A similar chimeric gene was created and assembled in a plasmid designated as pMON155, shown in 
20 Rgure 32 and described in Example 10. This chimeric gene resembled the gene in pMON156, with two 
exceptions: 

1. an oligonucleotide linker having stop codons In ail three reading frames was inserted between the 
CaMV(19S) partial structural sequence and the NPT II structural sequence; and, 

2. the spurious ATG sequence on the 5* side of the NPT II structural sequence was deleted. 

25 The construction of this chimeric gene Is described In Example 10. This gene was inserted into A. 
tumefaciens cells and subsequently into plant ceils. Its level of expression was apparently higher than the 
expression of the similar gene in pMON156. as assayed by growth on higher concentrations of kanamydn. 
A. tumefaciens cells containing co-integrate Tl::pMON156 plasmids have been deposited with the American 
Type Culture (Center, and have been assigned ATCC accession number 39338. 

30 

Creation of Chimeric CaMV(32S)-NPT ll-NOS Genes 

in an alternate preferred embodiment of this invention, a chimeric gene was created comprising 
35 (1) a promoter region which causes transcription of the 32S CaMV mRNA; 

(2) a structural sequence which codes for NPT II; and 

(3) a NOS 3* non-translated region. 

The assembly of this chimeric gene is described in Example 11 and Figures 33 through 37. This gene 
was inserted into plant cells and it caused them to become resistant to kanamydn. 
40 Petunia plants cannot normally be infected by CaMV. Those skilled in the art may determine through 
routine experimentation whether any particular plant viral promoter (such as the CaMV promoter) will 
function at satisfactory levels in any particular type of plant cell, including plant ceils that are outside of the 
normal host range of the virus from which the promoter was derived. 

45 

Means for Inserting Chimeric Genes Into Plant Cells 

A variety of methods are known for Inserting foreign DNA into plant cells. One such method, utilized by 
the Applicants, Involved Inserting a chimeric gene into Ti plasmids carried by A. tumefaciens, and co- 

so cultivating the A. tumefaciens cells with plants. A segment of T-DNA carrying The chimeric gene was 
transferred into the plant genome, causing transformation. This method Is described in detail In two 
separate U.S. patent applications entitled "Plasmids for Transforming Plant Cells," serial number 458,411, 
(WO84/02919) and -Genetically Transformed Plants,' serial number 458.402, (W084/02920) both of which 
were filed on January 17, 1983. 

55 A variety of other methods are listed below. These methods are theoretically capable of inserting the 
chimeric genes of this invention into plant cells, although the reported transformation efficiencies achieved 
to date by such methods have been low. The chimeric genes of this Invention (especially those chimeric 
genes such as NPT I and NPT II, which may be utilized as selectable markers) are likely to facilitate 
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research on methods of Inserting DNA into plants or plant cells. 

1. One alternate technique for Inserting DNA Into plant cells Involves the use of lipid vesicles, also 
called liposomes. Liposomes may be utilized to encapsulate one or more DNA molecules. The liposomes 
and their DNA contents may be taken up by plant cells; see. e.g.. Lurquin, 1981. If the inserted DNA can be 

5 incorporated into the plant genome, replicated, and inherited, the plant cells will be transformed. 

To date, efforts to use liposomes to deliver DNA into plant cells have not met with great success 
(FraJey and Papahadjopoulos, 1981). Only relatively small DNA molecules have been transferred into plant 
cells by means of liposomes, and none have yet been expressed. However. Hposome-deUvery technology Is 
still being actively developed, and it is likely that methods will be developed for transferring plasmids 

io containing the chimeric genes of this invention into plant cells by means involving liposomes. 

2. Other alternate techniques involve contacting plant cells with DNA which is complexed with either (a) 
polycationlc substances, such as poly-t-omithine (Davey et at, 1980). or (b) calcium phosphate (Krens et al. 
1982). Although efficiencies of transformation achieved to date have been low. these methods are still being 
actively researched. 

is 3. A method has been developed involving the fusion of bacteria, which contain desired plasmids. with 
plant cells. Such methods Involve converting the bacteria into spheroplasts and converting the plant cells 
into protoplasts. Both of these methods remove the cell wall barrier from the bacterial and plant cells, using 
enzymic digestion. The two cell types can then be fused together by exposure to chemical agents, such as 
polyethylene glycol. See Hasezawa et al, 1981. Although the transformation efficiencies achieved to date by 

20 this method have been low. similar experiments using fusions of bacterial and animal cells have produced 
good results; see Rassoulzadagan et al. 1982. 

4. Two other methods which have been used successfully to genetically transform animal cells Involve 
(a) direct microinjection of DNA Into animal cells, using very small glass needles (Capecchi. 1980), and (b) 
electric-current-induced uptake of DNA by animal cells (Wong and Neumann, 1982). Although neither of 

25 these techniques have been utilized to date to transform plant cells, they may be useful to insert chimeric 
genes of this invention into plant cells. 



Meaning of Various Phrases 

30 

A variety of phrases which are used in the claims must be defined and described to clarify the meaning 
and coverage of the claims. 

The meaning of any particular term shall be interpreted with reference to the text and figures of this 
application. In particular, It is recognized that a variety of terms have developed which are used 

35 inconsistently In the literature. For example, a variety of meanings have evolved for the term •promoter," 
some of which Include the 5' non-translated region and some of which do not. In an effort to avoid problems 
of Interpretation, the Applicants have attempted to define various terms. However, such definitions are not 
presumed or intended to be comprehensive and they shall be interpreted in light of the relevant literature. 
The term "chimeric gene" refers to a gene that contains at least two portions that were derived from 

40 different and distinct genes. As used herein, this term is Dmited to genes which have been assembled, 
synthesized, or otherwise produced as a result of man-made efforts, and any genes which are replicated or 
otherwise derived therefrom. "Man-made efforts" Include enzymatic, cellular, and other biological pro- 
cesses, if such processes occur under conditions which are caused, enhanced, or controlled by human 
effort or intervention; this excludes genes which are created solely by natural processes. 

45 As used herein, a "gene" Is limited to a segment of DNA which Is normally regarded as a gene by 
those skilled In the art For example, a plasmld might contain a plant-derived promoter region and a 
heterologous structural sequence, but unless those two segments are positioned with respect to each other 
in the plasmld such that the promoter region causes the transcription of the structural sequence, then those 
two segments would not be regarded as included in the same gene. 

so This invention relates to chimeric genes which have structural sequences that are "heterologous" with 
respect to their promoter regions. This includes at least two types of chimeric genes: 

1. DNA of a gene which Is foreign to a plant cell. For example. If a structural sequence which codes for 
mammalian protein or bacterial protein is coupled to a plant promoter region, such a gene would be 
regarded as heterologous. 

55 2. A plant cell gene which is naturally promoted by a different plant promoter region. For example, if a 
structural sequence which codes for a plant protein is normally controlled by a low-quantity promoter, the 
structural sequence may be coupled with a prolific promoter. This might cause a higher quantity of 
transcription of the structural sequence, thereby leading to plants with higher protein content Such a 
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structural sequence wouW be regarded as heterologous with regard to the prolific promoter. 

However, it is not essential for this invention that the entire structural sequence be heterologous with 
respect to the entire promoter region. For example, a chimeric gene of this invention may be created which 
would be translated Into a "fusion protein". I.e., a protein comprising polypeptide portions derived from two 
s separate structural sequences. This may be accomplished by Inserting all or part of a heterologous 
structural sequence into the structural sequence of a plant gene, somewhere after the start codon of the 
plant structural sequence. 

As used herein, the phrase, "a promoter region derived from a specified gene" shall include a promoter 
region If one or more parts of the promoter region were derived from the specified gene. For example. It 

to might be discovered that one or more portions of a particular plant-derived promoter region (such as 
intervening region 8. shown on Figure 1) might be replaced by one or more sequences derived from a 
different gene, such as the gene that contains the heterologous structural sequence, without reducing the 
expression of the resulting chimeric gene in a particular type of host cell. Such a chimeric gene would 
contain a plant-derived association region 2, intervening region 4, and transcription Initiation sequence 6, 

is followed by heterologous intervening region 8, 5* non-translated region 10 and structural sequence 14. Such 
a chimeric gene is within the scope of this invention. 

As used herein, the phrase "derived from - shall be construed broadly. For example, a structural 
sequence may be "derived from" a particular gene by a variety of processes, including the following: 

1. the gene may be reproduced by various means such as inserting it into a plasmid and replicating the 
20 plasmid by cell culturing, In vitro replication, or other methods, and the desired sequence may be obtained 

from the DNA copies by various means such as endonuclease digestion; 

2. mRNA which was coded for by the gene may be obtained and processed in various ways, such as 
preparing complementary DNA from the mRNA and then digesting the cDNA with endonucleases; 

3. the sequence of bases in the structural sequence may be determined by various methods, such as 
25 endonuclease mapping or the MaxanvGilbert method. A strand of DNA which duplicates or approximates 

the desired sequence may be created by various methods, such as chemical synthesis or ligation of 
oligonucleotide fragments. 

4. a structural sequence of bases may be deduced by applying the genetic code to the sequence of 
amino acid residues in a polypeptide. Usually, a variety of DNA structural sequences may be determined 

30 for any polypeptide, because of the redundancy of the genetic code. From this variety, a desired sequence 
of bases may be selected, and a strand of DNA having the selected sequence may be created. 

If desired, any DNA sequence may be modified by substituting certain bases for the existing bases. 
Such modifications may be performed for a variety of reasons. For example, one or more bases in a 
sequence may be replaced by other bases in order to create or delete a cleavage site for a particular 

35 endonuclease. As another example, one or more bases In a sequence may be replaced in order to reduce 
the occurrence of "stem and loop" structures In messenger RNA. Such modified sequences are within the 
scope of this invention. 

A structural sequence may contain Introns and exons; such a structural sequence may be derived from 
DNA. or from an mRNA primary transcript. Alternately, a structural sequence may be derived from 
40 processed mRNA, from which one or more introns have been deleted. 

The Applicants have deposited two cultures of E. coll cells containing plasmlds pMON128 and 
PMON154 with the American Type Culture Collection (ATCC). These cells have been assigned ATCC 
accession numbers 39264 and 39265, respectively. 

Those skilled in the art will recognize, or be able to ascertain using no more than routine experlmenta- 
45 tion, numerous equivalents to the specific embodiments described herein. Such equivalents are within the 
scope of this invention. 



EXAMPLES 

50 

Example 1 Creation of pMONIOOl 

Fifty micrograms <ug) of lambda phage bbkan-1 DNA (Berg et al, 1975) were digested with 100 units of 
55 Hindlll (all restriction endonucleases were obtained from New England Biolabs, Beverly, MA, and were used 
with buffers according to the suppliers instructions, unless otherwise specified) for 2 hr at 37* C. After heat- 
inactivation (70* C, 10 mln). the 3.3 kb Tn5 Hindlll fragment was purified on a sucrose gradient One ug of 
the purified Hindlll fragment was digested with BamHI (2 units. 1 hr, 37* C), to create a 1.8 kb fragment 



22 



EP0 131 623 B1 



The endonuclease was heat Inactivated. 

Plasmid pBR327 (Soberon et al. 1981). 1 ug, was digested with Hindlil and BamHI (2 units each. 2 hr, 
37* C). Following digestion, the endonucleases were heat inactivated and the cleaved pBR327 DNA was 
added to the BamHI-HJndlll Tn5 fragments. After addition of ATP to a concentration of 0.75mM f 10 units of 

5 T4 DNA ligase (prepared by the method of Murray et al. 1979) was added, and the reaction was allowed to 
continue for 16 hours at 12-14* C. One unit of T4 DNA ligase will give 90% drcularization of one ug of 
Hmdlll-cleaved pBR327 plasmid in 5 minutes at 22* C. 

The iigated DNA was used to transform CaCfe-shocked E. coli C600 rec A50 cells (Maniatis et aJ, 
1982). After expression in Luria broth (LB) for 1 hour at 37* C the cells were spread on solid LB media 

to plates containing 200 ug/ml amplcillin and 40 ug/ml kanamycin. Following 16 hour incubation at 37* C. 
several hundred colonies appeared. Plasmid mini-prep DNA was prepared from six of these. (Ish-Horowicz 
and Burke. 1981). Endonuclease digestion showed that all six of the plasmids carried the 1.8 kb Hindlil- 
BamHI fragment One of those isolates was designated as pMON1001 as shown in Figure 6. 

75 

Example 2: Creation of pMON40 

Five ug of plasmid pMON1001 (described in Example 1) was digested with Smal. The reaction was 
terminated by phenol extraction, and the DNA was precipitated by ethanol. A BamHI linker CCQGATCCGG 

20 (0.1 ug), which had been phosphorylated with ATP and T4 polynucleotide kinase (Bethesda Research 
Laboratory, Rockvilte, MD) was added to 1 ug of the pMON1001 fragment The mixture was treated with T4 
DNA Dgase (100 units) for 18 hours at 14* C. After heating at 70* C for 10 min to inactivate the DNA ligase. 
the DNA mixture was digested with BamHI endonuclease (20 units. 3 hours. 37* C) and separated by 
electrophoresis on an 0.5% agarose gel. The band corresponding to the 42 kb Smal-BamHI vector 

25 fragment was excised from the gel. The 4.2 kb fragment was purified by adsorption on glass beads 
(Vogelstein and Gillespie, 1979). ethanol precipitated and resuspended in 20 ut of DNA ligase buffer with 
ATP. T4 DNA ligase (20 units) was added and the mixture was incubated for 1.5 hours at room temperature. 
The DNA was mixed with rubidium chloride-shocked E. coll C600 cells for DNA transformation (Maniatis et 
al, 1982). After expression for 1 hour at 37 *C In LB, the cells were spread on LB plates containing 200 

30 ug/ml of ampiciWn and 20 ug/ml kanamycin. The plates were incubated at 37* C for 18 hours. Twelve 
ampiciilin-resistant kanamycin-resistant colonies were chosen. 2 ml cultures were grown, and mini-plasmld 
preparations were performed. Endonuclease mapping of the plasmids revealed that ten of the twelve 
contained no Smal site and a single BamHI site, and were of the appropriate size, 4.2 kb. The plasmid from 
one of the ten colonies was designated as pMON40, as shown in Figure 6. 

35 

Example 3: Creation of NOS Promoter Fragment 

An oligonucleotide with the following sequence, 5 , -TGCAGATTATTTGG-3\ was synthesized (Beaucage 

40 and Carruthers. 1981. as modified by Adams et al. 1982). This oligonucleotide contained a ^P radioactive 
label, which was added to the 5* thymidine residue by polynucleotide kinase. 

An M13 mp7 derivative, designated as SIA. was given to Applicants by M. Bevan and M.-D. Chilton, 
Washington University. St. Louis. MO. To the best of Applicants* knowledge and belief, the SIA DNA was 
obtained by the following method. A pTiT37 plasmid was digested with Hindlil, and a 3.4 kb fragment was 

45 isolated and designated as the Hindlll-23 fragment This fragment was digested with Sau3a. to create a 344 
bp fragment with Sau3a ends. This fragment was inserted into double-stranded, replicative form DNA from 
the M13 mp7 phage vector (Messing et al. 1981) which had been cut with BamHI. Two recombinant phages 
with 344 bp inserts resulted, one of which contained the anti-sense strand of the NOS promoter fragment 
That recombinant phage was designated as SIA. and a clonal copy was given to the Applicants. 

$o The Applicants prepared the single-stranded form of the SIA DNA (14.4 ug; 6 pmol). and annealed it (10 
minutes at 70* C, then cooled to room temperature) with 20 pmol of the 14-mer oligonucleotide, mentioned 
above. The oligonucleotide annealed to the Sau3a insert at bases 286-300 as shown on Figures 4 and 5. 

200 ut of the SIA template and annealed oligonucleotide were mixed with the four dNTP's (present at a 
final concentration of imM, 25 u!) and 50 ul of Ktenow polymerase. The mixture incubated for 30 minutes at 

55 room temperature. During this period, the polymerase added dNTP's to the 3* end of the oligonucleotide. 
The polymerase was heat-inactivated (70 *C, 3 min). and Haelil (160 units) were added. The mixture was 
incubated (1 hour, 55* C). the Haelil was inactivated (70* C, 3 min), and the four dNTP's (ImM. 12 ul) and 
T4 DNA polymerase (50 units) were added. The mixture was Incubated (1 hour, 37* C) and the polymerase 
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was Inactivated (70* C, 3 min). This yielded a fragment of about 570 bp. EcoRI (150 units) was added, the 
mixture was incubated (1 hour, 37* C) and the EcoRJ was inactivated (70* C, 3 min). 

ATtquots of the mixture were separated on 6% polyacrylamlde with 25% glycerol. Autoradiography 
revealed a radioactively labelled band about 310 bp in size. This band was excised. The foregoing 
5 procedure is indicated by Figure 5. 



Example 4: Creation of pMON58 

to Five ug of plasm Id pMON40 (described in Example 2) were digested with BgHI (10 units, 1.5 hour, 
37* C), and the Bglll was inactivated (70* C. 10 min). The four dNTPs (1mM, 5 ul) and Klenow polymerase 
(8 units) were added, the mixture was incubated (37* C, 40 min), and the polymerase was inactivated 
(70* C. 10 min). EcoRI (10 units) was added and incubated (1 hour. 37* C), and caff alkaline phosphatase 
(CAP) was added and incubated (1 hour, 37* C). A fragment of about 3.9 kb was purified on agarose gel 

is using NA-45 membrane (Scheicher and Scheull. Keene NH). The fragment (1.0 pM) was mixed with the 
NOS promoter fragment (0.1 pM), described In Example 3, and with T4 DNA ligase (100 units). The mixture 
was incubated (4* C. 16 hr). The resulting plasm Ids were Inserted into 6. ooli cells, which were selected on 
media containing 200 ug/ml amplcilfin. Thirty-six clonal Amp R colonies were selected, and minl-preps of 
plasmids were made from those colonies. The plasmid from one colony demonstrated a 308 bp EcoRl-Bglll 

20 fragment a new Sstll cleavage site carried by the 308 bp NOS fragment, and a new Pstl site. This plasmid 
was designated as pMON58. as shown in Figure 7. pMON58 ONA was prepared as described above. 



Example 5: Creation of PMON42 

25 

Plasmid pBR325-Hindlll-23. a derivative of plasmid pBR325 (Bolivar, 1978) carrying the Hindlll-23 
fragment of pTIT37 (see Figure 3) in the Hindlil site, was given to Applicants by M. Bevan and M.-D. 
Chilton. Washington University, St Louis, MO. DNA of this plasmid was prepared and 30 ug were digested 
with Hindlil (50 units) and BamHI (50 units). The 1.1 kb Hindlll-BamHI fragment was purified by adsorption 

30 on glass beads (Vogelstein and Gillespie. 1979) after agarose gel electrophoresis. The purified fragment 
(0.5 ug) was added to 0.5 ug of the 2.9 kb Hindlll-BamHI fragment of pBR327. After treatment with DNA 
ligase (20 units. 4 hours. 22 *C), the resulting plasmids were Introduced to E. coli C600 cells. Clones 
resistant to ampfcNQn at 200 ug/ml were selected on solid media; 220 clones were obtained. Minipreps of 
plasmid DNA were made from six of these clones and tested with the presence of a 1.1 kb fragment after 

35 digestion with Hindlil and BamHI. One plasmid which demonstrated the correct insert was designated 
pMON42. Plasmid pMON42 DNA was prepared as described in previous examples. 



Example 6: Creation of Ml 3 Clone M-2 

40 

Seventy-five ug of plasmid pMON42 (described in Example 5) prepared from dam- E coli cells were 
digested with Rsal and BamHI (50 units of each. 3 hours. 37* C) and the 720 bp Rsal-BamHiTragment was 
purified using NA-45 membrane. Eight ug of the purified 720 bp BamHhRsai fragment were digested with 
Moot (10 min, 70* C), the ends were made blunt by filling In with the large Klenow fragment of DNA 

45 polymerase I and the four dNTP's. Then 0.1 ug of the resulting DNA mixture was added to 0.05 ug of M13 
mp8 previously digested with Smal (1 unit, 1 hour 37* C) and caff alkaline phosphatase (0.2 units). After 
ligation (10 units of T4 DNA ligase, 16 hours. 12* C) and transection of E. coll JM101 cells, several hundred 
recombinant phage were obtained. Duplex RF DNA was prepared from twelve recombinant phage-carrying 
clones. The RE DNA (0.1 ug) was cleaved with EcoRI, (1 unit, 1 hour, 3J*C). end-labeled with *P-dATP 

so and Klenow polymerase, and re-digested with BamHI (I unit, 1 hour, 37* C). The EcoRI and BamHI sites 
span the Smal site. Therefore, clones containing the 260 bp Mbol fragment could be identified as yielding a 
labelled 270 bp fragment after electrophoresis on 6% polyacrylamlde gels and autoradiography. Four of the 
twelve clones carried this fragment The orientation of the insert was determined by digestion of the EcoRl- 
cleaved. end-labeled RF DNA (0.1 ug) with Hlnft (1 unit 1 hour, 37* C). Hinfl cleaves the 260 bp Mbol 

55 fragment once 99 bp from the 3' end of the fragment and again 42 bp from the end nearest the NOS 
coding region. Two clones of each orientation were obtained. One clone, digested as M-2 as shown in 
Figure 8, contained the 260 bp fragment with the EcoRI site at the 3' end of the fragment M-2 RF DNA was 
prepared using the procedures of Messing, et al 1981. 
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Example 7: Creation of pM0N75 and pMON78 

Fifty ug of M-2 RF DNA (described in Example 6) were digested with 50 units of EcoRJ and 50 units of 
BamHI for 2 hours at 37* C. The 270 bp fragment (1 ug) was purified using agarose gel and NA-45 

5 membrane. Piasmid pMON58 (described in Example 4) was digested with EcoRI and BamH! (50 ug, 50 
units each. 2 hours, 37 *C) and the 1300 bp fragment was purified using NA-45 membrane. The 270 bp 
EcoRI-BamHl (0.1 ug) and 1300 bp EcoRI-BamHI (0.5 ug) fragments were mixed, treated with T4 DNA 
ligase (2 units) for 12 hours at 14* C. After heating at 70* C for 10 minutes to inactivate the ligase, the 
mixture was treated with EcoRI (10 units) for 1 hour at 37* C. then heated to 70* C for 10 minutes to 

10 inactivate the EcoRI. This completed the assembly of a chimeric NOS-NPT W-NOS gene on a 1.8 kb 
fragment, as shown on Figure 9. 

Piasmid pMON3S is a clone of the pTiT37 Hindill-23 fragment inserted In the Hlndlli site of pBR327 
(Soberon et al. 1980). pMON38 DNA (20 ug) was digested with EcoRI (20 units. 2 hours, 37* C) and calf 
alkaline phosphatase (0.2 units, 1 hour, 37 *C). The pMON38 DNA reaction was extracted with phenol. 

is precipitated with ethanol. dried and resuspended in 20 ul of 10 mM Trls-HCI. 1 mM EDTA. pH 8. 



0.2 ug of the cleaved pMON38 DNA was added to the chimeric gene mixture described above. The mixture 
was treated with T4 DNA ligase (4 units, 1 hour, 22* C) and mixed with Rb chloride-treated E. coii C600 rec 

20 A56 cells to obtain transformation. After plating with selection for ampictllin-resistant (200 ug/mi) colonies, 
63 potential candidates were obtained. Alkaline mini-preps of piasmid DNA were made from 12 of these and 
screened by restriction endonuclease digestion for the proper constructs. Piasmid DNA's that contained a 
1.5 kb EcoRI fragment and a new Bglil site were digested with BamHI to determine the orientation of the 
1.5 kb EcoRI fragment. One of each Insert orientation was picked. One piasmid was designated pMON75 

25 and the other pMON78, as shown in Figure 9. DNA from these plasmids were prepared as described in 
previous examples. 

Example 8: Creation of plasmids PMON128 and pMON129 

30 

The 1.5 kb EcoRI fragment was excised by EcoRI digestion from either pMON75 or pMON76 and 
purified after agarose gel electrophoresis as described In previous examples. Five ug of DNA from piasmid 
pMON120 (described in a separate application, "Plasmids for Transforming Plant Cells, • (WO84/02919) 
cited previously) was digested with EcoRI and treated with calf alkaline phosphatase. After phenol 

36 deproteinl2ation and ethanol precipitation, the EcoRWeaved pMON120 Dnear DNA was mixed with 0.5 ug 
of the 1.5 kb EcoRI chimeric gene fragment The mixture was treated with 2 units of T4 DNA ligase for 1 
hour at 22oC. After transformation of E. coll cells (Maniatls et al. 1982) and selection of colonies resistant to 
spectinomycin (50 ug/ml). several thousand colonies appeared. Six of these were picked, grown, and 
piasmid mini-preps made. The piasmid DNA's were digested with EcoRI to check for the 1.5 kb chimeric 

40 gene insert and with BamHI to determine the orientation of the insert. BamHI digestion showed that in 
pMON128 the chimeric gene was transcribed in the same direction as the Intact nopaline synthase gene of 
pMON120. The orientation of the insert in pMONl29 was opposite that in pMON128: the appearance of an 
additional 1.5 kb BamHI fragment In digests of pMON129 showed that piasmid pMONl29 carried a tandem 
duplication of the chimeric NOS-NPT ll-NOS gene, as shown In Figure 10. 

45 

Example 9: Creation of Piasmid pMON156 

Plasmids which contained CaMV DNA were a gift to Monsanto Company from Dr. R. J. Shepherd, 
so University of California, Davis. To the best of Applicants' knowledge and belief, these plasmids (designated 
as pOSf) were obtained by Inserting the entire genome of a CaMV strain designated as CM4-184 (Howarth 
et 1, 1981) into the Sal I restriction site of a pBR322 piasmid (Bolivar et al, 1977). £ coB cells transformed 
wfth pOSI were resistant to amplcillln (Amp R ) and sensitive to tetracycline fTet 8 ). 

Various strains of CaMV suitable for isolation of CaMV DNA which can be used in this invention are 
55 publicly available; see. e.g., ATCC Catalogue of Strains II. p. 387 (3rd edition. 1981). 

pOSI DNA was cleaved with Hlndlli. Three small fragments were purified after electrophoresis on an 
0.8% agarose gel using NA-45 membrane (Schleicher and Schuell. Keene NH). The smallest fragment, 
about 500 bp In size, contains the 19S promoter. This fragment was further purified on a 6% acrylamide 
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gel. After various manipulations which did not change the sequence of this fragment (shown in Figure 28). it 
was digested with Mbol to create a 455 bp HtndllKMbol fragment This fragment was mixed with a 1250 bp 
fragment obtained by digesting pMON75 (described in Example 7 and shown in Figure 9) wfth Bglll and 
EcoRI. This fragment contains the NPT11 structural sequence and the NOS 3* non-translated region. The two 
5 fragments were ligated together by their compatible Mbol and Bgiil overhangs to create a fragment 
containing the CaMV(19S)-NPTII-NOS chimeric gene. This fragment was inserted into pMON120 which had 
been cleaved with Hindi!! and EcoRI. The resulting plasmid was designated as pMON158. as shown in 
Figure 29. 

Plasmid pMON156 was inserted into E. coil cells and subsequently into A. tumefaciens cells where it 
10 formed a co-integrate Tl plasmid having the CaMV(19S)-NPT It-NOS chimeric gene surrounded by T-DNA 
borders. A. tumefaciens cells containing the co-integrate plasmids were co-cultivated with petunia ceils. The 
co-cultivated petunia cells were cultured on media containing kanamycin. Some of the co-cultivated petunia 
ceils survived and produced colonies on media containing up to 50 ug/rnl kanamycin. This indicated that the 
CaMV(19S)-NPT ll-NOS genes were expressed in petunia cells. These results were confirmed by Southern 
rs blot analysis of transformed plant cell DNA. 



Example 10: Creation of pMON155 

20 Plasmid pMON72 was obtained by inserting a 1.8 kb Hindlll-BamHI fragment from bacterial transposon 
Tn5 (which contains an NPTII structural sequence) into a Psti- pBR327 plasmid digested with Hindlll and 
BamHI. This plasmid was digested with Bglll and Pstl to remove the NPTII structural sequence. 

Plasmid pMON1001 (described in Example 1 and shown in Figure 6) from dam- cells was digested with 
Bglll and Pstl to obtain a 218 bp fragment with a partial NPTII structural sequence. This fragment was 

25 digested with Mbol to obtain a 1 94 bp fragment 

A triple ligation was performed using (a) the large Pstl-BglH fragment of pMON72; (b) the Pstl-Mboi 
fragment from pMONIOOl; and (c) a synthetic linker with Bgiil and Mbol ends having stop codons in ail 
three reading frames. After transformation of E. coli cells and selection for ampicillin resistant colonies, 
plasmid DNA from Amp R colonies was analyzed. A colony containing a plasmid with the desired structure 

30 was identified. This plasmid was designated pMON1 10. as shown on Figure 30. 

In order to add the 3 V end of the NPT II structural sequence to the 5* portion in pMONHO. pMON110 
was treated with Xhol. The resulting overhanging end was filled in to create a blunt end by treatment with 
Klenow polymerase and the four deoxy-nucleotide triphosphates (dNTP's). A, T, C, and Q. The Klenow 
polymerase was inactivated by heat, the fragment was digested with Pstl, and a 3.6 kb fragment was 

35 purified. Plasmid pMON76 (described in Example 7 and shown in Figure 9) was digested with Hindlll, filled 
in to create a blunt end with Klenow polymerase and the four dNTP's, and digested with Psti. An 1100 bp 
fragment was purified, which contained part of the NPT II structural sequence, and a nopaline synthase 
(NOS) 3' non-translated region. This fragment was ligated with the 3.6 kb fragment from pMONIlO. The 
mixture was used to transform E. coli cells; Amp R cells were selected, and a colony having a plasmid with 

40 the desired structure was identified. This plasmid was designated pMON132, as shown on Figure 31. 
Plasmid pMON93 (shown on Figure 28) was digested with Hindlll, and a 476 bp fragment was isolated. This 
fragment was digested with Mbol. and a 455 bp fragment was purified which contained the CaMV (19S) 
promoter region and 5' non-translated region. Plasmid pMON132 was digested with EcoRI and Bglll to 
obtain a 1250 bp fragment with (1) the synthetic linker equipped with stop codons in all three reading 

45 frames; (2) the NPT II structural sequence; and (3) the NOS 3' non-translated region. These two fragments 
were joined together through the compatible Mbol and Bglll ends to create a CaMV (19S)-NPT ll-NOS 
chimeric gene. 

This gene was inserted into pMON120, which was digested with Hindlll and EcoRJ, to create plasmid 
pMON155, as shown in Figure 32. 

so Plasmid pMON155 was inserted Into A. tumefaciens GV3111 ceils containing a Tl plasmid, pTIB653. 
The pMON155 plasmid formed a ccintegraTe plasmid with the Tl plasmid by means of a single crossover 
event Cells which contain this co-integrate plasmid have been deposited with the American Type Culture 
Collection, and have been assigned ATCC accession number 39338. A fragment which contains the 
chimeric gene of this Invention can be obtained by digesting the co-integrate plasmid with Hindlll and 

55 EcoRI, and purifying the 1.7 kb fragment These ceils have been used to transform petunia celts, allowing 
the petunia cells to grow on media containing at least 100 ug/ml kanamycin. 



26 



EP 0 131 623 B1 



Example 11: Creation of PMON183 and 184 

Plasmid pOSI (described in Example 9) was digested with Bgill, and 1200 bp fragment was purified. 
This fragment contained the 32S promoter region and part of the 5* non-translated region. It was Inserted 
s into plasmid pSHL72 which had been digested with BamHI and Bgill (pSHL72 is functionally equivalent to 
pAG060. described in Colbere-Garapin et al, 1981). The resulting plasmid was designated as pMON50, as 
shown on Figure 33. 

The cloned Bgill fragment contains a region of DNA that acts as a polyadenylation site for the 32S RNA 
transcript This polyadenylation region was removed as follows: pMON50 was digested with Avail and an 

to 1100 bp fragment was purified. This fragment was digested with EcoRf and EcoRV. The resulting 190 bp 
EcoRI-EcoRV fragment was purified and inserted into plasmid pBR327, which had been digested with 
EcoRI end EcoRV. The resulting plasmid, pMON81, contains the CaMV 32S promoter on a 190 bp EcoRV- 
EcoRI fragment, as shown on Figure 33. 

To make certain the entire promoter region of CaMV(32S) was present in pMON81. a region adjacent to 

15 the 5* (EcoRV) end of the fragment was inserted into pMON81 in the following way. Piasmid pMON50 
prepared from dam- ceils was digested with EcoRI and Bgill and the resultant 1550 bp fragment was 
purified and digested with Mboi. The resulting 725 bp Mbol fragment was purified and Inserted Into the 
unique Bgill site of plasmid pKC7 (Rao and Rogers. 1979) to give plasmid pMON125, as shown in Figure 
34. The sequence of bases adjacent to the two Mbol ends regenerates Bglil sites and allows the 725 bp 

20 fragment to be excised with Bglil. 

To generate a fragment carrying the 32S promoter, the 725 bp Bglil fragment was purified from 
pMON125 and was subsequently digested with EcoRV and Alul to yield a 190 bp fragment Plasmid 
pMON81 was digested with BamHI. treated with Klenow polymerase and digested with EcoRV. The 3.1 kb 
EcoRV-BamHI(Munt) fragment was purified, mixed with the 190 bp EcoRV- Alul fragment and treated with 

25 DNA figase. Following transformation and selection of ampidlfin-resistant cells, plasmid pMON172 was 
obtained which carries the CaMV(32S) promoter sequence on a 380 bp BamHKEcoRI fragment, as shown 
on Figure 35. This fragment does not carry the poly-adenylation region for the 32S RNA. Ligation of the Alul 
end to the fitled-in BamHI site regenerates the BamHI site. 

To rearrange the restriction endonuclease sites adjacent to the CaMV(32S) promoter, the 380 bp 

30 BamHI-EcoRI fragment was purified from pMON172. treated with Klenow polymerase, and Inserted into the 
unique Smal site of phage M13 mp8. One recombinant phage, M12, carried the 380 bp fragment in the 
orientation shown on Figure 36. The repllcative form DNA from this phage carries the 32S promoter 
fragment on an EcoRHS^BamHKS') fragment 

Ptasmids carrying a chimeric gene (CaMV(32S) promoter region-NPT II structural sequence-NOS 3* 

3s non-translated region) were assembled as follows. The 380 bp EcoRi-BamHI CaMV (32S) promoter 
fragment was purified from phage M12 RF DNA and mixed with the 1250 bp Bglll-EcoRI NPT ll-NOS 
fragment from pMON75. Joining of these two fragments through their compatible BamHI and Bgill ends 
results In a 1.8 kb CaMV(32S)-NPT ll-NOS chimeric gene. This gene was inserted into pMON120 at the 
EcoRI site in both orientations. The resultant plasmlds. pMON183 and 184. appear in Figure 37. These 

40 plasmlds were used to transform petunia cells. The transformed cells are capable of growth on media 
containing 100 ug/ml kanamycln. 
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45 Claims 

1. A chimeric gene capable of expressing a polypeptide In plant cells comprising in sequence: 

(a) a promoter region from a gene which is naturally expressed In plant cells; 

(b) a 5' non-translated region; 

so (c) a structural coding sequence encoding a neomycin phosphotransferase polypeptide; and 

(d) a 3' non-translated region of a gene naturally expressed in plant cells, said region encoding a 
signal sequence for polyadenylaHon of mRNA; said promoter being heterologous with respect to the 
structural coding sequence. 

55 2. A gene of Claim 1 in which the promoter is selected from a gene of the group consisting of a nopaiine 
synthase gene and a ribulose -1.5-bis-phosphate carboxylase small subunit gene. 

3. A gene of Claim 1 in which the 3* non-translated region Is selected from a gene from the group 
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consisting of the genes from the T-DNA region of Agrobacterlum tumefaclens. 

4. A gene of Claim 1 or 2 In which the 3 1 non-translated region Is from the nopallne synthase gene of 
Agrobacterlum tumefaclens. 

5. A chimeric gene capable of expressing a polypeptide In plant ceils comprising in sequence: 

(a) a promoter region from a plant virus; 

(b) a 5* non-translated region; 

(c) a structural coding sequence; 

(d) a 3' non-translated region of a gene naturally expressed in plants, said region encoding a signal 
sequence for polyadenylation of mRNA. said structural coding sequence being heterologous with 
respect to said promoter region. 

6. A gene of claim 5 in which the promoter is from cauliflower mosaic virus. 

7. A gene of claim 6 in which the 3' non-translated region is from a nopallne synthase gene. 

a A gene of claim 5 in which the promoter is the full-length transcript promoter of cauliflower mosaic 
vims. 

9. A gene of claim 8 in which the 3' non-translated region is from a nopaline synthase gene. 

10. A culture of microorganisms identified by ATCC accession number 39265. 



Revendlcatlons 

1. Gene chimSrique capable d'exprimer un polypeptide dans des cellules vSgdtales. comprenant succes- 
sivement: 

(a) una region promotrice provenant d'un gene qui est naturellement exprime* dans des cellules 
vegetates; 

(b) une region non traduite 5*; 

(c) une sequence de codage structurale codant un polypeptide, la nSomycine phosphotransferase; et 

(d) une region non traduite 3' d'un gene exprimd naturellement dans les cellules vegeHales, cette 
region codant une sequence de signal pour la polyadenylation de I'ARNm; 

ce promoteur Stant necrologue par rapport a la sequence de codage structurale. 

2. Gene seion la revendication 1. dans lequel le promoteur est choisi parmi le gene de la nopaline 
synthase et un gene qui est une petite sousunite* de ribulose -1.5-bis-phosphatee carboxylase. 

3. Gene selon la revendication 1, dans lequel la region non traduite 3* est choisie parmi les genes de la 
region ADN-T d'Agrobacterium tumefaclens. 

4. Gene selon les levendications 1 ou 2, dans lequel la region non traduite 3' provient du gene de la 
nopaline synthase d'Agrobacteiium tumefaclens. 

5. Gene chime'rlque capable d'exprimer un polypeptide dans des cellules vege'tales, comprenant succes- 
sfvement: 

(a) une region promotrice d'un virus de plante; 

(b) une region non traduite 5'; 

(c) une sequence de codage structurale; 

(d) une region non traduite 3' d'un gene exprime* naturellement dans les plantes. cette region codant 
une sequence de signal pour la poly-adenylation de I'ARNm; cette sequence de codage structurale 
Stant necrologue par rapport a cette region promotrice. 

6. Gene selon la revendication 5, dans lequel le promoteur est le virus de la mosaxjue du choux-fleur. 

7. Gene selon la revendication 6, dans lequel la nSgion non traduite 3' provient d'u. gene de la nopallne 
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synthase. 

8. Gene selon la revendlcation 5. dans lequet la promoteur est la promoteur de transcription en plelne 
longueur du virus de la mosalque du choux-fleur. 

5 

9. Gene selon la revendlcation 8. dans lequel la region non tradulte 3* provlent d'un gene de la nopallne 
synthase. 

10. Culture de mlcroorganismes, identifiee par le numSro description ATCC 39265. 

10 

Ansprtiche 

1. ChimSres Gen. das ein Polypeptid in Pfianzenzelien exprimieren kann und in Sequenz umfaflt: 
15 (a) einen Promotorbereich von einem Gen. das in Pfianzenzelien nattirllch exprfmlert wird; 

(b) einen S'-nicht-Obersetzten Berelch; 

(c) elne strukturelle Kodiersequenz, die ein Neomycin-Phosphotransferase-Polypeptid kodleit und 

(d) einen 3'-nichMlbersetzt8n Bereich eines in Pfianzenzelien natUrlich exprimierten Gens, welcher 
Bereich eine Signaisequenz fUr PolyadenyOerung von mRNA kodiert; wobei der Promotor in bezug 

20 auf die strukturelle Kodiersequenz heterolog ist 

2. Gen nach Anspruch 1. in dem der Promotor gewahft ist von einem Gen der Gruppe bestehend aus 
einem NopaOn-Synthase-Gen und einem Ribulose-1.5-biprK)Sphat^anboxylase-Gen kleiner Subeinheft 

25 3. Gen nach Anspruch 1, in dem der 3Wiicht-0bersetzte Bereich gewfihlt 1st von einem Gen der Gruppe 
bestehend aus den Genen vom T-DNA-Bereich von Agrobacterium tumefaciens. 

4. Gen nach Anspruch 1 Oder 2. in dem der 3'-nichMlbersetzte Bereich vom Nopalin-Synthase-Gen von 
Agrobacterium tumefaciens Ist 

30 

5. Chimares Gen. das ein Polypeptid In Pfianzenzelien exprimieren kann und in Sequenz umfaflt 

(a) einen Promotorbereich von einem Pflanzenvirus; 

(b) einen ?-nicht-tibersetzten Bereich; 

(c) eine strukturelle Kodiersequenz und 

35 (d) einen S'-nicht-Obersetzten Bereich eines in Pfianzenzelien natUrttch exprimierten Gens, welcher 

Bereich eine Signaisequenz Wr Polyadenylierung von mRNA kodiert; wobel die strukturelle Kodier- 
sequenz in bezug auf den Promotorbereich heterolog ist 

6. Gen nach Anspruch 5, in dem der Promotor vom Blumenkohlmosaikvirus ist 

AO 

7. Gen nach Anspruch 6, In dem der 3'-nlcht-abersetzte Bereich von einem Nopattn-Synthase-Gen ist 

a Gen nach Anspruch 5. in dem der Promotor der Transkriptpromotor in voller GrSBe vom Blumenkohl- 
mosaikvirus ist 

45 

a Gen nach Anspruch 8. in dem der T-nicht-Qbersetzte Bereich von einem Nopalin-Synthase-Gen ist 
10. Kultur von Mikroorganismen identifiziert durch die ATCC Nr. 39265. 

so 
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42 



ISOLATE FRAGMENT CONTAINING |48 
(NOS) PLANT PROMOTER REGION, 
5 ' NON-TRANSLATED REGION AND 
START OF STRUCTURAL SEQUENCE 



ISOLATE FRAGMENT CONTAINING 
HETEROLOGOUS(NPT-II) 
STRUCTURAL SEQUENCE 



44| REMOVE STRUCTURAL SEQUENCE 
FROM FRAGMENT 
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50 



INSERT FRAGMENT INTO PLASMIO 
WITH CLEAVAGE SITE NEAR 
START COOON ( pMON 40} 



ISOLATE FRAGMENT WITH (NOS) 
PLANT PROMETER REGION AND 
5* NON-TRANSLATED REGION 
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X 



52 



CLEAVE PLASMIO AT SITE 
NEAR HETEROLOGOUS START 
COOON 



INSERT (NOS) PLANT PROMOTER FRAGMENT INTO PLASMIO 
(pMON 40): RELIGATE; SELECT CELLS WITH FRAGMENT 
IN PROPER ORIENTATION (pMON 56) 



56 



CLEAVE PLASMIO ( pMON 56 ), 
ISOLATE FRAGMENT CONTAINING 
(NOS) PLANT PROMOTER REGION 
AND (NPTII) HETEROLOGOUS 
STRUCTURAL SEQUENCE 
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ISOLATE FRAGMENT CONTAINING 
3' NON-TRANSLATED REGION OF 
PLANT (NOS) GENE 



60 



LIGATE FRAGMENTS TOGETHER IN PROPER 
ORIENTATION TO 08TAIN CHIMERIC 
(NOS-NPT ll-NOS)GENE 



INSERT CHIMERIC GENE INTO SELECTED 
PLASMIO (pMON I20)T0 OBTAIN CHIMERIC 
PLASMIO (pMON I28, pMON I29) 

z 



INSERT CHIMERIC PLASMIO 
INTO PLANT CELLS 



3: 



INSERT CHIMERIC PLASMIO 
INTO A TUMEFACIENS TO 
CREATE CO-INTEGRATE Tl 
PLASMIO 



I 



INFECT PLANT CELLS WITH 
A. TUMEFACIENS 



] 



IDENTIFY TRANSFORMED 
PLANT CELLS 



IDENTIFY TRANSFORMED 
PLANT CELLS 



FIG. 2. 
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SYNTHETIC PRIMER (5* 32 P) 
ANNEALED TO SINGLE STRANDED 
MI3 CLONED DNA 



NEW STRAND 
OF DNA 



KLENOW POLYMERASE 
UNLABELED dNTPs 





T4DNA POLYMERASE 
UNLABLED dNTPs 



DIGEST WITH EcoRI 
PURIFY 308 bp 
FRAGMENT 




Sou3a 



FIG. 5. 
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PHAGE A ONA CARRYING Tn 5 DNA 



EcoRl ^Hindlll 

Bom HI 



Hind III Hind III 



DIGEST WITH Hind III 
PURIFY 3.3 Kb FRAGMENT 



/ 



Bom HI Hind III 




DIGEST WITH Hind III 
NPTII Km R DIGEST T AND Bom HI 

GENE WITH Bom HI I M IX, UG ATE, TRANSFORM CELLS 
EcoR. . Hind III SELECT Amp R Km* 



DIGEST WITH 
Smo I 




Hind III 
EeoRI 




ADO Bom HI LINKER 
5-CCGGATCCGG 
GGCCTAGGCC-S' 
LI GATE, DIGEST 
WITH Bam HI 



Smo I 



EcoRI .Hind III 



BomHI 



EeoRI . H,n <* »» 





BomHI FROM 
TL LINKER 

//Bom HI 
//(Smo I) 

(/ 500 bp 
N^v FRAGMENT 

Bom Hl^ Bam HI 



PURIFY LARGE 4.2 Kb FRAGMENT 
LIGATE TO CIRCUL ARIZE.TRANSFORM 
CELLS SELECT Amp* Km* CELLS 



BomHI 



FIG. 6. 
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FI6.7. 



DIGEST WITH Bgl II.CONVERT TO BLUNT ENDS 
WITH KLENOW DNA POLYMERASE + 4dNTPl 
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Bam HI 



NOS STOP J 
COOON ' 



Rso I Hind III 
^NOS POLY A SIGNAL 



Hind III 



Bam HI 



IIOO bp FRAGMENT FROM 
Hind III -23 FRAGMENT 




327 



Hind III 



Rsa I 




„ TRANSFORM, . 
SELECT Amp" CELLS 



Bom HI 



NOS 
POLY- A 
SIGNAL 

NOS STOP 
COOON 



PREPARE pMON 42 ONA FROM 
«Kam - 3 CELLS.OIGEST WITH 
Rso l,BomHI,PURIFY720bD 
FRAGMENT * ^ , 



8am HI 



Mbol 
Mbol 



Sma I 
EcoRI 



Bam HI DIGEST WITH. 
Psi I Smo I 

•Sol I 
'Hind ill 



NOS STOP *" 

COOON 

. DIGEST WITH Mbo I 
/CONVERT TO BLUNT ENOS 

MIX/TREAT WITH DNA 
LI GASE, TRANS FORM 
CELLS, OBTAIN PHAGE 
PLAQUES 

SCREEN RFDNAs FOR 
260 bp INSERT 

"SELECT CLONE WITH 
PROPER ORIENTATION 

NOS POLY A SIGNAL 




FIG. 8. 
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DIGEST WITH EcoRI AND 
Bern HI PURIFY 1300 bp 
FRAGMENT 



E coRI 



Bgl II 



BomHI BomHI 

=d \ 



DIGEST WITH EcoRI 
AND BomHI PURIFY 
280 bp FRAGMENT 

EcoRI 



NOS PROMOTER NPT 1 1 STRUCTURAL 
REGION ANO 5' SEQUENCE ANO 
NON-TRANSLATED 3 ' NON-TRANSLATED 
REGION REGION 



EcoRI 

bz 




NOS 3' NON- TRANSLATED 
REGION 

EcoRI 



NOS NPT 1 1 

CHIMERIC GENE 



MIX ANO LI GATE 
BomHI EcoRI 
I ■ I 




Bom HI 
OIGEST WITH EcoRI 



BomHI 




BomHI 



BomHI 



BomHI 



BomHI 



FIG. 9. 
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EcoRI 




EeoRI 



T-DNA 
RIGHT 
BOROER 



DIGEST WITH EcoRI 
PURIFY 1.5 Kb 
FRAGMENT 



EcoRI 



Bgl II 



Bam 



3d 



EcoRI 




NOS NPTII NOS3'NON-TRANS- 
PROMOTER STRUCTURAL LATED REGION 
REGION AND SEQUENCE 
5* NON-TRANS- 
LATED REGION 



EcoRI 



T40NA LIGASE 
TRANSFORM CELLS 
SELECT Spc* CELLS 




Bom 

NOS- NPT II- NOS 



EcoRI 

Bam 

Bam EcoRI^ 
NOS-NPTII-NOS NOS-NPTIINOS 



FIG. 10. 
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Bom HI 




DIGEST WITH 
Bom HI, Avo I, 
BLUNT ENDS 
WITH KLENOW 
POLYMERASE 
+ 46 NTPs 



DIGEST 
WITH 
Bom HI 



MIX.LIGATE, 
TRANSFORM, 

SELECT Amp" 
Km* CELLS 



Avo I 





GGATC 
CCTAG-5* 



CCCGA 
GGGCT-5' 



AOO SYNTHETIC 
Kpnl LINKER 
5'CCGGTACCGG 

GGCCATGGCC-5' 
LIGATE, 
TRANSFORM, 
SELECT Amp* CELLS 



OIGEST 
WITH Avo II 
BLUNT ENDS 
WITH KLENOW 
POLYMERASE Xho I 
+ 4d NTPS 

ADO Bom HI LINKER: 
5-CCGGATCCGG 
GGCCTAGGCC-5' 

Bom HI 



Xho I 




Avo II 



FIG. II. 



Bom HI 
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EcoRI 



EcoRI 




OIGEST WITH 
EcoRI, BomHI 
PURIFY 1.9 Kb 
FRAGMENT 



EcoRI 



Bam HI 



Pst I 




Pst I 



Bam HI 



DIGEST WITH EcoRI, 
Bom HI, PURIFY 
1300 bp FRAGMENT 




BomHI 



EcoRI Pst 1 Pst I 
1 1 1 



Bam HI 



NOS PROMOTER 
ANO 5' NON- 
TRANSLATED 
REGIONS, 



NPTII STRUCTURAL 
SEQUENCE 



MIX, LIGATE,TRANS FORM,- 
SELECT Amp" CELLS 



FIG. 12. 




Pst I 



BomHI 
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EcoRI 




NOS PROMOTER, 5' 
NON-TRANSLATED 
REGIONS 
Pst I 



Pst I 



Bom HI 



DIGEST Pst I, 

Bam HI PURIFY 2.4 Kb 

FRAGMENT 



EcoRI 




8am HI 



DIGEST Xhol.BamHI 
PURIFY 950 bp 
FRAGMENT 



• CTGCA-3' SYNTHETIC LINKER 

PROMOTER I 
5'NON-TRANS- / ATSAGCCATATTCAACGGAAACGTCTTGC 

LATED 3- ACGTTACTCGGTATAAGTTGCCTTTGCAGAACGAGCT-5' 



REGIONS 



^Bom Hl^ 

•G 5-GATCC 
•CCTAG-5' G 



Xhol 



MIX.LIGATE. TRANSFORM, 
SELECT Amp" CELLS 




PARTIAL NPT I 
STRUCTURAL 
SEQUENCE 



GAGCT-5' 



SYNTHETIC 
LINKER 

Xho I 



FIG. 13. 



Bam HI 
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EcoRI 



EcoRI 




Xhol 



Bom HI 



OIGEST WITH EcoRI, 
Bom HI PURIFY I300bp 
FRAGMENT 




NOS 
POLY-A SIGNAL 



Bom HI 



EcoRI EcoRI 




EeoR 



b 



DIGEST WITH EcoRI, 
Bom HI PURIFY 280bp 
FRAGMENT 



Xhol BomHI Bom HI EcoRI 



+ 



d b 



NOS NPT I 
PROMOTER STRUCTURAL 
5' NONTRANS- SEQUENCE NOS 3' 
LATEO REGIONS NONTRANSLATEO 

REGION 



NOS POLY-A SIGNAL 
EcoRI J 



FIG. 14. 



BomHI 



Ecol 

NOS POLY-A 
SIGNAL 
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EcoRl 




EcoRI 



DIGEST WITH EcoRI 
PURIFY 1.6 Kb 
FRAGMENT 



EcoRI 
I 



Xhol 
— i 




EcoRI 



Bom HI EcoRI 
I ■ 1 



NOS NPT I NOS3' 

PROMOTER, STRUCTURAL NONTRANS- 
5 NONTRANSLATED SEQUENCE LATEO 
REGIONS REGION 




DIGEST EcoRI, 
CAP 



MIX . U GATE TRANSFORM , 
SELECT Spc" CELLS 



EcoRI 




EcoRI 



Bom HI Xhol 
NOS POLY-A SIGNAL 



Xhol 

NOS POLY-A SIGNAL* 



FIG. 15. 
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EcoR 



Hind III 




89I II 



OIGEST WITH 
B9I II 



EcoR I 



Hind III 




SOYBEAN SMALL SUBUNIT 
PROMOTER FRAGMENT 

Hind III 
Mbel 

EcoR I 




Mbol 



OIGEST WITH MbO I 
PURIFY 800 bp 
FRAGMENT 



Hind III Mbol 



±=1 



Sbss PROMOTER 
5* NONTRANSLATEO 
REGIONS 



MIX, LIGATE.TRANSFORM, 
SELECT Amp" CELLS 



EcoR I Hind III 




Hind III 
Mbol/ Bgl II JOINT 
Bel I 



FIG. 17. 
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EcoRI 



FIG. 18. 



NOS POLY- A 
SIGNAL 



EcoRI 





DIGEST EcoRI, Bel I 
PURIFY 1200 bp 
FRAGMENT 




OIGEST EcoRI, Bal II 
PURIFY I2506P 
FRAGMENT 



Bel I Bgl I Bom HI EcoRI 
i f 111 



S bss PROMOTER, 
5' NONTRANSLATED 
REGIONS 



EcoRI 



OIGEST 
WITH 
EcoRI 



NPT II \ 
STRUCTURAL \ 
SEOUENCE 

NOS 3* ' 
NONTRANS- 
LATED REGION 



MIX.LIGATE.TRANSFORM, 
SELECT Spc* CELLS 



NOPALINE 
SYNTHASE 



pMON 141 



SpC* 
Sir* 

NPT II STRUCT. 

NOS SEQ. Sb*s 
N r. 5 / PROMOTER 

" ^ Eco I 



EcoR 



Bom HI 



Hind III 



NOS 
POLY- A 
SIGNAL 



Bgl 11/ 

Xbci I 

JOINT 




Hindltl 



Bell, 

"8glll 
JOINT 



EeoRI 



NOS 

POLY- A 
SIGNAL 
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EcoRI 



EcoRI 



Clo I 




bGH 
START 
COOON 



Hind 111 




Bom HI 
Hind III 



DIGEST 
WITH Hind III, EcoRI 
PURIFY 570 bp 
FRAGMENT 



DIGEST Hind III. EcoRI 
PURIFY 290 bp 
FRAGMENT 



Hind III 



Clo I 



EcoRI 



BGH STRUCTURAL 
SEQUENCE 

OIGEST 
EcoRI 
CAP 



^BomHl\^ C om 

~ ~~~ "~ NOS 3" 

NON-TRANS- 
LATED REGION 



MIX , LI GATE ,TR AN S FORM . 
SELECT AmpR CELLS 




Hind Ml 
Bom HI 



NOS POLY-A 
SIGNAL 
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EcoRI 



Hind III 




OIGEST WITH Xbo I 
CREATE BLUNT ENDS 
WITH KLENOW 
POLYMERASE +4 
d NTPs 



DIGEST WITH EcoRI 
CREATE BLUNT ENDS 
WITH KLENOW POLY- 
MERASE + 4d NTPs 
PURIFY 900 bp 
FRAGMENT 




XbO I (BLUNT) 
EeoRI 
(BLUNT) 



Clol 



Hind III 



EeoRI 
(BLUNT) 

1 



Xbo I "^~7 
(BLUNT) N0S3* 
NON TRANSLATED 



Bgl II 



REGION 



BGH 
STRUCTURAL 
SEQUENCE 



FIG. 20. 



MIX.LIGATE.TRANSFORM, 
SELECT Amp* CELLS 



k — NOS POLY - A 
SIGNAL 




Hind III 



Bgl II 



so 
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Clo I 



Hind III 



Clo I 




NOS POLY-A 
SIGNALS 



Bgl 11/ Bel I 



Hind III 
Bell/ Bgl III P q2v-A 
FIG. 21. SIGNAL 
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CIO I 




Bgl II 



EcoRI 




Hind III 



Bgl II 



DIGEST WITH ECO Rl. \ DIGEST ^TH CI o I. 
Bgtll PURIFY 308bp\ BgMt JJJJJJg 0 bp 
FRAGMENT 1 POifiMPNT 




EcoRI Bgl II Bgl II 

I it== 



Hind HI Clo I 



NOS BGH NOS 3 

PROMOTER STRUCTURAL NON- 
,5'NONTRANS- SEQUENCE TRANS- 
Sp >H LATEO REGIONS ^ LATED 

REGION 



OIGEST WITH 
Clot. EcoRI 
PURIFY 8 Kb 
FRAGMENT 



Eeo Rl 



MIX, LIGATE.TRANS- 
FROM. SELECT Spc« 
CELLS 



FIG. 22. 



Clo I 

NOS 
POLY -A 
SIGNAL 




EcoRI 



Bgl II 



Hind III 
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CIO' -Hind III 




OIGEST WITH Clo I 
PURIFY 2.5Kb 
FRAGMENT 



Clo I 



Clo I 



BstEII 
Clol I Ndtll 
1 1 I 



Hind III 



Clol 



EcoRI 



Clo I 




Bom HI 



EPSP SYNTHASE 
STRUCTURAL 
SEQUENCE 



OIGEST Clo I TREAT/ 
WITH ALKALINE 
PHOSPHATASE 



MIX.LIGATE, 
TRANSFORM, SELECT 
Amp* CELLS 



Clo I 

EcoRI — 1 1 f— Hind III 



FIG. 23. 





Bom HI 
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EcoRI ,, Hind III 



OIGESTWITH 
Nd« 1,8am HI 
PURIFY 4.9Kb 
FRAGMENT 



'AmpR^N 
EPSP\\ 

synthaseY 

STRUCTURAL! ... , 
SEQUENCE /rr" Nd * 1 
pMON 8 /Aninf I 

8st E II 

Bom H I 

Ndtl 

EPSP SYNTHASE ^ == *"" 



DIGEST WITH Ndt I, 
Bst E II 
PURIFY 



EcoRI ,, Hind III 




STRUCTU RAL SEOU ENCE I START COOON 
5* PORTION DIGEST WITH Hlnf I 



Ndtr— £tatg 
CA^^ AC 



•G 

■ CTAA 



GTAT EPSP SYNTHASE 5* PORTION 
5' 



MA _, 

\ 

Hlnf I 



MIX, UGATE, TRANSFORM, 
SELECT AmpR CELLS " 



8am HI EPSP SYNTHASE START* , 
G fS\ CODON} 

CCTAG 6ATCCA6ATCTGTT6TAAGGAGTCTA6ACCATG6 # 
5' GjTCTA GACAACATTCCTCAGAirTCGTACCATT 
B«lll ' Xbol* 5* 

SYNTHETIC LINKER 



'Hind III 



FIG. 24. 



EPSP\\ 
SYNTHASE\> 
STRUCTURAL\ 
SEQUENCE \ 
pMOM 25 t 



•Nd» I 



Hlnf I 



EPSP SYNTHASE 
START CODON 

SYNTHETIC 
LINKER 
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EcoRI 



-Hind III 



EcoRI 



Clol 



Amp R 

EPSP\\ 

synthase^ 
structural! 
sequenceI 

pMON 25 // 



-Bfllll 



DIGEST 
WITH EcoRI 
Hind III 



Bom HI 




NOS POLY-A 
SIGNAL 

Hind III 



EcoRI 



Hind III 



DIGEST WITH EcoRI, 
Hind III 
PURIFY 290 bp 
FRAGMENT 



EcoRI C|fl > 




Hind III 



FIG. 25. 



NOS 3' NONTRAN SLATED 
REGION 



MIX, LIGATE.TRANSFORM, 
SELECT AmpR CELLS 



NOS POLY-A 
SIGNAL 

Hind III 
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EcoRI ^Clcl 




NOS POLY-A 
SIGNAL 

Hind III 



B9I II 




EcoRI 



Bgl II 

DIGEST WITH Clo I, Bgl II 
PURIFY 2.3 Kb FRAGMENT 



OIGEST WITH 

EcoRI, Bgl II 
PURIFY 308 bp 
FRAGMENT 



NOPAUNE 
SYNTHASE 



CIO I Hind III 
I ■ l 



Bgl II Bgl II EcoRI 
=d I l 



NOS 3' EPSP NOS PROMOTER 

a mom 120 I NON-TRANS. SYNTHASE 5* NON-TRANS- 

pMONIZO spc^i REGION STRUCTURAL LATED REGIONS 
\ St ' " SEQUENCE s 



Clo l< 



EcoRI 




OIGEST Clo I, 
EcoRI TREAT 
WITH CAP 



MIX.LIGATE, 
TRANSFORM, SELECT 
Spc R CELLS 



FIG. 26. 



ClOl 



NOS POLY-A 
SIGNAL 




Hind III 



Bgl II 
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ClQl 



NOS POLY-A SIGNAL EcoRI 



-Hind III 



SYNTHASE!] 
STRUCT. II 

pMON 146 SE0 - II 
FROM dom-3 // ) 
k ..CELLS 

Bol II 

OIGEST WITH Clol.Bgl II 
PURIFY 2.3Kb FRAGMENT 



Clo 




Bel I 



DIGEST WITH Clo I, 
Bel I. PURIFY I.I Kb 
FRAGMENT 




Hind III 



t 



Boll! Bell 
=i fc= 



Clo I 
=1 



Spc^\ NOS 3 EPSP Sbss PROMOTER, 

R| NON-TRANS. SYNTHASE 5'NON-TRANS- 
REGION STRUCTURAL LATEO REGIONS 
SEQUENCE 



DIGEST WITH 
Clo I TREAT 
EcoRI WITH CAP 



MIX, LIGATE.TRANS- 
FORM, SELECT 
Spc« CELLS 



FIG. 27. 



Clo I 



NOS POLY-A 
SIGNAL 




Hind III 



8gl ll/Bcl I 
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Soil 
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Hind III 



// PROMOTER^ 
9 5 REGIONS* 

pMON 93 
FROM daw 3 
CELLS 



NOS .POLY A 
EcoR1 1 /SIGNAL 
-* Bom HI 



DIGEST Hind III 
PURIFY 476 bp 
FRAGMENT 




Hind III 



Sac 1 



Mbol 

Hind III 



19 S PROMOTER 
5' REGIONS 



\ OK 




DIGEST Mbol 
PURIFY 435 bp 
FRAGMENT 

Hfftd j" Socl^^peolll 



DIGEST EcoRI, 
Bql II 

PURIFY IZSObp 
FRAGMENT 



EcoRI 



Bom HI 



Hind III 



EcoRI 



CaMV I9S NPT II NOS 3' 

PROMOTER, STRUCTURAL NON- 
5 REGIONS SEQUENCE TRANS- 
LATED 
REGION 

DIGEST EcoRI, 
Hind HI, CAP 

MIX, LIGATE.TRANSFORM, 
SELECT Spc R CELLS 



FIG. 29. 



Hind III 



Socl 




Mbol/ 
^Bfll II 



EcoRI 

V NOS 

^ POLY A 

_ , Jt SIGNAL 
BomHI 
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Hind III 



Bgl II 




B9MI 



Pstl 



80m HI 



Pttl 

Xhel 
DIGEST Pst I, Bgl II 
PURIFY 3.4 Kb 
FRAGMENT 




SPURIOUS 

ATG 
A bo I 

Pstl 




OI6ESTB9I II, Pstl 
PURIFY 2l8bp FRAGMENT 




Xbol 



GATCTAGTTAGTTAATCTAGAC MbO I 
ATCAATCAATTA6ATCTGCTAG 

SYNTHETIC 5 > h 
— GCTAGj. 



. ACGTC 



MIX.LIGATE, 
TRANSFORM 
SELECT AmpR 
CELLS 



PARTIAL NPT II 

STRUCTURAL 

SEQUENCE 



OIGESTMbol 
PURIFY 1 94 bp 
FRAGMENT 



FIG.30. 




XbO I -—SYNTHETIC LINKER 
Mbol 



Pst I 
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Pstl 



Xhol 




DIGEST WITH Xbol 
FILL- IN WITH KLENOW 
POLYMERASES 4 dNTPs 

DIGEST WITH Pst I 
PURIFY 3.6 Kb 
FRAGMENT 



Bgl II 
Xbol 




Pit I 



Bom HI 



EeoRI 
Hind III 



DIGEST WITH Hind III 
FILL-IN WITH KLENOW 
P0LYMERASE+4dNTPs 

DIGEST WITH Pstl 
PURIFY 1 1 00 bp 
FRAGMENT 



Hind III (BLUNT) 
BomHI j j 



3* PORTION OF N0S3* 
NPT II STRUCTURAL REGION 
SEQUENCE 



Xhol 
(BLUNT) 



FIG. 31. 



MIX, LI GATE, 
TRANSFORM, SELECT 
Amp" CELLS 

Bgl II 
Xbol 

Pst I 




NOS 3V 
REGION^/ Bom HI 

EeoRI 
Hind III 
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Hind III 



Sac I 
Mbol 




Bgl II 



Xbal 



Hind III 



OIGEST 
Hind III 
PURIFY 
476 bp 
FRAGMENT 



Hind III 



I9S PROMOTER 
5* REGIONS V 0l6ESTMb0 , 

. PURIFY 453 bp 
.FRAGMENT 
Hindi!) \ Mbal 

NOPALINE X » D CXI 

SYNTHASE 





Bom HI 

NOS 
POLY-A 
SIGNAL 

OIGEST Eco Rl 
Bgl ll 

PURIFY 1230 bp 
FRAGMENT 



Bgl II 
Xbal 



EcoRI 



Bam HI 



CaMV I9S 
PROMOTER, 
5 REGIONS 



MOOIFIED NPT II NOS 3* 
STRUCTURAL NON-TRANS- 
SEOUENCE ^LATEO REGION 



Hind III 



DIGEST 
EcoRI, 
Hind III.CAP 



EcoRI 



MIX, LIGATE. TRANSFORM, 
SELECT Spc R CELLS 



FIG. 32. 



Hind III 



Sac I 



Mbol 




I9S 

PROMOTER MO - 
'REGIONS N0 .f 
MOOIFIED V. 
NPT II STRUCT EcoRI 
SEO. 



Bam HI 



Bgl II 



"Xbal 
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Sail 



POLY-ASITE 



Sal I 




( pSHL72 j " 
^^^^..X^Bglll 



DIGEST WITH 
Bglll 

PURIFY 1200 bp 
FRAGMENT 



CaMV 
ONA 



DIGEST Bom HI, 
391 U 

PURIFY 
LARGE 
FRAGMENT 



Bglll 

fc= 



EcoRI. 



^8g l 11 



CaMV(32S)PR0M0TER, 
5'LEADER REGION 




BB0 . RI SamHI 



Ava II 



EeoRX 
EeaRI* 

Avoll 

-4 - 



DIGEST 
WITH 
Avoll 



Bfllll/^MIX, 

u, . v^LIGATE, 
HI/ ^TRANSFORM, 
•"Bgl II SELECT 
Amp* 
CELLS 



DIGEST WITH 
EcoRlTEcoRTT 
PURIFY 190 bp 
FRAGMENT 



EcoRI 



PURIFY 
1100 bp 
FRAGMENT 




EcoR-ST. 




•Avoll 



MIX, LfGATE, TRANSFORM, SELECT 
^ .Amp" CELLS nfcoR| 

^DIGEST EcoRlT* _ . ^ 
EcoRTC 0 <V\ EcoRI 

PURIFY 3.1 Kb X/AmpR 
FRAGMENT // 

5' NON-TRANSLATED] 
REGIONS 

pMON 81 



FIG. 33. 
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EcoRI 




DIGEST 
EcoRI, 
Bglll 



Bgl II 



EcoRI 



Mbol 



PURIFY 
I5SO bp 
FRAGMENT 



Mbol.- EcoRX 



3 



EcoRT 



Bgl II 



8gltl 



CoMV 32S PROMOTER, 
5' NON -TRANSLATED 
REGIONS 

OIGEST Mbol 
PURIFY 725 bp 
FRAGMENT 



EcoRX Mbol 




FIG. 34. 



MIX, LI 6 ATE, 
TRANSFORM, SELECT 
Amp" CELLS 



Bgl II 



Amp" 

32S VO 
PROMOTERAV 
5' NON- I) 
pMON 125 TRANS- 11 
LATED fl 
REGIONS 



-EcoRI 



Bgl II 
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EcoRI 




EcoRT 
Bom HI 



Bglll 




EcoRT 



Bgl II 



DIGEST Bam HI 
FILL-IN WITH KLENOW 
POLYMERASE + 4d NTP$ 



EcoRT 



DIGEST EcoRT 
PURIFY 3.1 Kb 
FRAGMENT 



EcoRI 



Bgl II 



"fc 



i-C 



DIGEST Bgl II 
PURIFY 725 bp 
FRAGMENT 

A,ul Bglll 



DIGEST EcoRT 
Alu I 

PURIFY 190 bp 
FRAGMENT 




FIG. 35. 



MIX.LIGATE, 
TRANSFORM, SELECT 
Amp" CELLS 



EcoRI EcoRT 
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EcoRI 



EcoRI 




• Smol 
■Bom HI 



8am H I 



DIGEST EcoRI. 

Bam HI 
FILL-IN ENOS WITH 
KLENOW POLYMERASE 
4 4d NTPs 
PURIFY 380 bp 
FRAGMENT 



MISmp 8 
RF 



DIGEST Smol, CAP 



MIX.LIGATE, 
TRANSFORM, SELECT 
AmpR CELLS 



EeoRI 




Bom HI 



FIG. 36. 
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ECORI 



CoMV 32S 
PROMOTER 
5' NON 
TRANSLATED 
REGION 



EeoRI 



NOS POLY- A 
SIGNAL 



Bom HI 



DIGEST EceRI.Bgl II 
PURIFY 1 250 bp 
FRAGMENT 



EeoRI 

NOS 
POLY-A 
SIGNAL 




BomHI 



Bgl II /BomHI 
JOINT 



CoMV 32S 
PROMOTER 

NPTII NOS 
STRUCT 3^ 

seq._ 

^EcoRI 

Bom HI N 0S 

POLY-A 
SIGNAL 



BomHI/ Bgl II 
JOINT 
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